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CANINE RESPIRATORY CORONAVIRUS (CRCV) 
SPIKE PROTEIN, POLYMERASE AND 
HEMAGGLUTININ/ESTERASE 

The present invention relates to biological material, and in particular to a 
canine respiratory coronavirus that is present in dogs having canine 
infectious respiratory disease. 

5 Canine infectious respiratory disease (CIRD) is a highly contagious disease 
common in dogs housed in crowded conditions such as re-homing centres 
and boarding or training kennels. Many dogs suffer only from a mild cough 
and recover after a short time, however in some cases a severe 
bronchopneumonia can develop (Appel and Binn, 1987). 

10 The pathogenesis of CIRD is considered to be multifactorial, involving 
several viruses and bacteria. The infectious agents considered to be the 
major causative pathogens of CIRD are canine parainfluenzavirus (CPIV) 
(Binn et ah, 1967), canine adenovirus type 2 (CAV-2) (Ditchfield et al, 
1962) and the bacterium Bordetella bronchiseptica (Bemis et al, 1977, Keil 

15 et al, 1998). Also, canine herpesvirus, human reovirus and mycoplasma 
species have been isolated from dogs with symptoms of CIRD (Karpas et 
al, 1968, Lou and Wenner 1963, Randolph et al, 1993) Additional factors 
like stress may also be important. 

CIRD is rarely fatal but it delays re-homing of dogs at rescue centres and it 
20 causes disruption of schedules in training kennels as well as considerable 
treatment costs. 

Vaccines are available against some of the infectious agents associated with 
this disease, namely Bordetella bronchiseptica as well as CPIV and CAV-2. 
However, despite the use of these vaccines, CIRD is still prevalent in 
25 kennels world-wide, which is possibly due to the vaccines not providing 
protection against all the infectious agents involved in CIRD. 
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We have discovered a novel coronavirus, which we have called canine 
respiratory coronavirus (CRCV), in a large kennelled dog population with a 
history of endemic respiratory disease, and we have shown that this virus is 
associated with CIRD. 

5 Some members of the family coronaviridae are known to cause respiratory 
disease in humans, cattle, swine and poultry (Makela et al, 1998, Pensaert 
et al, 1986, Ignjatovic and Sapats 2000). For example, bovine respiratory 
coronavirus is associated with shipping fever in cattle which is a 
multifactorial respiratory disease (Storz et al, 2000). 

10 However, coronaviruses were not suspected to have a role in the 
pathogenesis of CIRD. Indeed, with only a single exception, canine 
coronaviruses have been reported to be enteric viruses and to cause acute 
diarrhoea mainly in young dogs (for example, Tennant et al, 1993). In a 
large study of viruses involved in canine respiratory diseases, Binn et al 

15 (1979) reported the detection of a canine coronavirus in the lung of a single 
dog that was also infected with SV5 and canine adenovirus 2, two other 
viruses that are associated with canine respiratory disease. 

There are 30-40 dog vaccines commercially available in the UK for use 
against a number of pathogens that can cause a range of diseases, such as 
20 neurological, enteric, hepatic and respiratory diseases. Most of these 
vaccines contain microbial agents such as Distemper virus, Canine 
Adenovirus-2, Canine parvovirus, canine parainfluenza virus and 
Leptospira canicola and L. icterohaemorrhagiae. None of these vaccines 
contain canine coronaviruses. 

25 The dog vaccines for use against canine respiratory diseases are marketed as 
vaccines for "kennel-cough" (see below). All of the vaccines contain 
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Bordetella bronchisepticum, which is a bacterium associated with "kennel 
cough". 

Coyne M.J. & May S.W., (1995) in their article entitled "Considerations in 
using a canine coronavirus vaccine" (published as a Pfizer Technical 

5 Bulletin on the Internet at http://www.pfizer.com/ah/vet/tref/trbull/ 
ccv.html), lists over 20 commercially available vaccines against either 
canine coronaviruses alone or against canine coronaviruses together with 
other organisms. Each of these vaccines is for canine enteric disease, and 
there is no suggestion that a canine coronavirus may be associated with 

10 respiratory disease. 

US Patents Nos. 6,057,436 and 6,372,224, both to Miller et al and assigned 
to Pfizer, Inc., describe the spike gene of the enteric canine coronavirus and 
uses therefor, including its use as a vaccine against gastroenteritis. Neither 
of these two patents suggest that a canine coronavirus may be involved in 
15 CIRD. 

Members of the family coronaviridae are enveloped viruses, 80-160nm in 
diameter, containing a linear positive-stranded RNA genome. The structural 
proteins of coronaviruses are the spike glycoprotein (S), the membrane 
glycoprotein (M) and the nucleocapsid protein (N). The 

20 hemagglutinin/esterase glycoprotein (HE) is found only on the surface of 
group II coronaviruses (e.g. bovine coronavirus and murine hepatitis virus) 
(Spaan et al, 1988). Further details of the structure of coronoviruses may be 
found in the chapter by Cavanagh et al entitled "Coronviridae" p407-41 1, in 
"Virus Taxonomy, 6 th Report of the International Committee on Taxonomy 

25 of Viruses", pub. Springer-Verlag Wein, New York, Eds. Murphy et al, 
which is incorporated herein by reference. 
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The canine respiratory coronavirus (CRCV) of the invention may be 
characterised as a coronavirus present in the respiratory tracts of dogs with 
infectious respiratory disease. To further characterise CRCV, we have 
determined the sequence of 250 nucleotide residues of the CRCV 
polymerase (pol) cDNA (Figure 1 and SEQ ID NO: 1) which corresponds to 
an 83 amino acid partial sequence of the pol protein (Figure 2 and SEQ ID 
NO: 2). We have also cloned and determined the sequence of the 4092 
nucleotide residues of the CRCV spike (S) cDNA (Figure 3 and SEQ ID 
NO: 3), corresponding to 1363 amino acids (Figure 4 and SEQ ID NO: 4). 
We have also determined the sequence of 497 nucleotide residues of the 
CRCV hemagglutinin/esterase (HE) gene (Figure 13 and SEQ ID NO: 21), 
corresponding to 165 amino acids (Figure 14 and SEQ ID NO: 22). We 
have identified that CRCV has a surprisingly low homology to the enteric 
canine coronavirus (CCV) while it has an unexpectedly high level of 
homology to bovine coronavirus (strain LY138 or Quebec) and human 
coronavirus (strain OC43). 

A culture of "Spike D-l CRCV", which is XL 1 -Blue E. coli (Stratagene) 
containing a pT7Blue2 plasmid (Novagen) whose insert contains a portion 
of the CRCV spike cDNA, has been deposited under the Budapest Treaty at 
NCIMB Ltd under Accession number NCIMB 41 146 on 25 July 2002. The 
depositor of NCIMB 41146 is the Royal Veterinary College, Royal College 
Street, London NW1 OTU, UK. The address of NCIMB Ltd is 23 St. 
Machar Drive, Aberdeen, Scotland, AB24 3RY, UK. 

The phylogenetic relationship of CRCV to eleven known coronaviruses was 
determined based upon a comparison of the 250 nucleotide sequence from 
the CRCV pol gene and the corresponding regions of the other viruses 
(Figure 5). The bovine coronavirus (BCV), human coronavirus (HCV) 
strain OC43 and hemagglutinating encephalomyelitis virus (HEV) were 
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found to be most closely related to CRCV, while the enteric CCV was 
found to be only distantly related to CRCV. 

Over the 250 sequenced residues of the pol cDNA, corresponding to 83 
amino acids, CRCV has only 68.5% and 75.9% sequence identity at the 
nucleotide and amino acid levels, respectively, with the equivalent region of 
the enteric CCV (strain 1-71) pol gene (Genbank Accession No. 
AF124986), as shown in Figure 6 and 7. 

Over the 4092 sequenced nucleotide residues of the CRCV S gene, 
corresponding to 1363 amino acids, CRCV has 45% and 21.2% sequence 
identity at the nucleotide (Figure 8) and amino acid levels, respectively, 
with the equivalent region of the enteric CCV (strain 1-71) S gene. 

Enteric CCV is not a group II coronavirus and does not possess an HE gene, 
hence it is not possible to determine the extent of sequence identity between 
this gene in CRCV and in enteric CCV. 

Except as described below, the percentage identity between two nucleotide 
or two amino acid sequences was determined using FASTA version 34 
(Pearson WR. (1990) "Rapid and sensitive sequence comparison with 
FASTP and FASTA". Methods Enzymol; 183:63-98). FASTA settings were 
Gap open penalty -16 and Gap extension penalty -4. 

The percentage identity between the CRCV and enteric CCV spike 
sequences was determined using GCG version 10 (Genetics Computer 
Group, (1991), Program Manual for the GCG Package, Version 7, April 
1991, 575 Science Drive, Madison, Wisconsin, USA 53711). The GCG 
parameters used were: Gap creation penalty 50, gap extension penalty 3 for 
DNA, and Gap creation penalty 8 and Gap extension penalty 2 for Protein. 
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Sequence alignments were performed using ClustalX (Thompson et al, 
1997). 

By contrast, over the 250 sequenced residues of the pol cDNA, CRCV has 
98.8% sequence identity with the equivalent region of the BCV strain 
5 Quebec pol gene (Genbank Accession No. AF220295), 98.4% sequence 
identity with the BCV strain LY138 pol gene (Genbank Accession No. 
AF124985) and 98.4% sequence identity with the HCV OC43 pol gene 
(Genbank Accession No. AF124989). 

There was only a single amino acid difference between the CRCV pol 
10 protein over the 83 sequenced amino acids and the BCV, HCV and HEV 
pol proteins which is that CRCV has E (Glu) as opposed to D (Asp) at' the 
position corresponding to position 4975 in the BCV genome (Accession No. 
SWALL: Q91A29). Thus the CRCV pol protein is 99% identical to the 
BCV, HCV and HEV pol proteins over this region. 

15 The one and three letter amino acid codes of the IUPAC-IUB Biochemical 
Nomenclature Commission are used herein. 

Over the 497 sequenced nucleotide residues, corresponding to 165 amino 
acids, of the HE gene, CRCV has 98.994% and 98.2% sequence identity 
with the equivalent region of the BCV strain LY138 HE gene (Genbank 

20 Accession No. AF058942) at the nucleotide and amino acid levels 
respectively. CRCV has 98.189% (nucleotide) and 98.2% (amino acid) 
sequence identity with human enteric coronavirus (HECV) HE gene 
(Genbank Accession No. L07747); 97.4% (nucleotide) and 95.2% (amino 
acid) sequence identity with the HCV OC43 HE gene (Genbank Accession 

25 No. M76373); and 92.0% (nucleotide) and 93.9% (amino acid) identity with 
HEV (Genbank Accession Nos. AF481863), as shown in Figures 15 and 16. 
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As shown in Figure 16 and Table 3, the three amino acids that are different 
between the CRCV HE protein and each of the BCV, HECV, HCV and 
HEV S proteins, within the 165 amino acids of the CRCV HE protein, are F 
(Phe) as opposed to L (Leu), N (Asn) as opposed to T (Thr), and L (Leu) as 
5 opposed to V (Val) at positions corresponding to position 235, 242 and 253, 
respectively, in the BCV, HECV, HCV OC43 and HEV HE genes (Figure 
16). Thus F at position 235, N at position 242 and L at position 253 could 
be said to be CRCV HE protein-specific amino acids. 

Over the 4092 sequenced nucleotide residues, corresponding to 1363 amino 
10 acids, of the CRCV S gene, CRCV has 97.3% and 96% identity with the 
equivalent region of BCV strain LY138 (Genbank Accession No. 
AF058942) at the nucleotide and amino acid levels respectively. CRCV has 
96.9% (nucleotide) and 95.2% (amino acid) identity with HCV strain OC43 
(Genbank Accession No. Z32768), and 83.8% (nucleotide) and 80.4% 
15 (amino acid) identity with HEV (Genbank Accession Nos. AF481863 
(cDNA) and AAM 77000 (protein)) as shown in Figures 9 and 10. 

The amino acids that are different between the CRCV S protein and each of 
the BCV, HCV and HEV S proteins, within the 1363 amino acids of the 
CRCV S protein, are listed in Table 1 below. Thus the amino acids listed in 
20 Table 1 could be said to be CRCV S protein-specific amino acids. The 
amino acids are numbered from the initial M residue at the start of the 
CRCV protein, as shown in Figure 4. 
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Table 1; List of 39 amino acids specific to the CRCV S protein that are 
not present in the BCV, HCV and HEV S proteins. 



Position 


Amino acid 




Position 


Amino acid 


103 


V 




692 


G 


118 


V 




695 


S 


166 


D 




757 


w 


171 


M 




758 


G 


179 


K 




763 


Q 


192 


P 




769 


T 


210 


S 




786 


P 


235 


H 




792 


H 


267 


F 




818 


R 


388 


F 




827 


P 


407 


M 




828 


V 


436 


S 




887 


F 


440 


I 




933 


D 


447 


I 




977 


F 


501 


F 




1011 


T 


525 


Y 




1018 


S 


528 


N 




1063 


K 


540 


L 




1256 


L 


582 


K 




1257 


M 


608 


G 
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A first aspect of the invention provides a coronavirus S protein, or fragment 
thereof, having at least 75% amino acid sequence identity with the CRCV S 
protein whose amino acid sequence is listed in Figure 4, and having at least 
one of V at position 103; V at position 1 18; D at position 166; M at position 
171; K at position 179; P at position 192; S at position 210; H at position 
235; F at position 267; F at position 388; M at position 407; S at position 
436; I at position 440; I at position 447; F at position 501; Y at position 525; 
N at position 528; L at position 540; K at position 582; G at position 608; G 
at position 692; S at position 695; W at position 757; G at position 758; Q at 
position 763; T at position 769; P at position 786; H at position 792; R at 
position 818; P at position 827; V at position 828; F at position 887; D at 
position 933; F at position 977; T at position 1011; S at position 1018; K at 
position 1063; L at position 1256; and M at position 1257. The amino acids 
are numbered from the initial M at the start of the CRCV S protein, as listed 
in Figure 4 (SEQ ID NO: 4). 

It is appreciated that the partial nucleotide sequence of CRCV S can be 
readily determined by a person or ordinary skill in the art by sequencing the 
insert of the plasmid contained in E. coli strain D-l CRCV, that has been 
deposited under the Budapest Treaty at NCIMB Ltd. under Accession 
number NCIMB 41146 on 25 July 2002. Furthermore, this DNA can be 
used as a hybridisation probe, or as the basis for the design of probes, in the 
isolation of CRCV nucleic acid in dogs. 

For the avoidance of doubt, the invention includes a coronavirus S protein, 
or fragment thereof, having at least 75% amino acid sequence identity with 
the CRCV S protein (SEQ ID NO: 4), and comprising at least one of the 
amino acids specific for the CRCV S protein at the position listed in Table 
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By "protein" we also include the meaning glycoprotein. The amino acid 
sequence of a glycoprotein refers to the amino acid sequence of the 
polypeptide backbone of the glycoprotein, irrespective of the type, number, 
sequence and position of the sugars attached thereto. 

5 Typically, the invention includes an isolated or recombinant protein, and not 
an unmodified CRCV protein present as a CRCV component. 

The invention includes a coronavirus S protein, or fragment thereof, having 
at least 76% amino acid sequence identity with the CRCV S protein (SEQ 
ID NO: 4), or at least 77%, or at least 78%, or at least 79%, or at least 80%, 

10 or at least 81%, or at least 82%, or at least 83%, or at least 84%, or at least 
85%, or at least 86%, or at least 87%, or at least 88%, or at least 89%, or at 
least 90%, or at least 91%, or at least 92%, or at least 93%, or at least 94%, 
or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 
99% amino acid sequence identity with the CRCV S protein, and 

15 comprising at least one of the amino acids specific for the CRCV S protein 
at the position listed in Table 1 . 

The invention also includes a coronavirus S protein, or fragment thereof, 
having at least 75%, or at least 80%, or at least 85% or at least 90% or at 
least 95% amino acid sequence identity with the CRCV S protein (SEQ ID 

20 NO: 4), and comprising at least 2, or at least 3, or at least 4, or at least 5, or 
at least 6, or at least 7, or at least 8, or at least 9, or at least 10, or at least 11, 
or at least 12, or at least 13, or at least 14, or at least 15, or at least 16, or at 
least 17, or at least 18, or at least 19, or at least 20, or at least 21, or at least 
22, or at least 23, or at least 24, or at least 25, or at least 26, or at least 27, or 

25 at least 28, or at least 29, or at least 30, or at least 31, or at least 32, or at 
least 33, or at least 34, or at least 35, or at least 36, or at least 37, or at least 
38 of the amino acids specific for CRCV S protein at the positions listed in 
Table 1. 
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Preferably, the coronavirus S protein, or fragment thereof comprises all 39 
of the amino acid residues specific for CRCV S protein at the positions 
listed in Table 1. 

Thus the invention includes a BCV, HCV or HEV S protein or fragment 
thereof, that has been modified at at least one position listed in Table 1 to 
resemble the CRCV S protein. 

Preferably, the coronavirus S protein of the invention is a CRCV S protein 
that comprises or consists of the sequence listed in Figure 4 (SEQ ID NO: 
4), or a variant thereof with at least 97% identity with the sequence listed in 
Figure 4. Preferably, the variant has at least 98%, or at least 99% amino 
acid sequence identity with the sequence listed in Figure 4. More 
preferably the variant has at least 99.1%, or at least 99.2%, or at least 
99.3%, or at least 99.4%, or at least 99.5%, or at least 99.6%, or at least 
99.7%, or at least 99.8%, or at least 99.9% amino acid sequence identity 
with the sequence listed in Figure 4. 

Thus the variant of the coronavirus S protein of the invention includes a 
protein that comprises or consists of the sequence listed in Figure 4 (SEQ 
ID NO: 4) but has between 1 and 40 amino acid differences from the 
sequence listed in Figure 4. Preferably, the variant has less than 40 amino 
acid differences from the sequence listed in Figure 4. More preferably the 
variant has less than 35, less than 30, or less than 25, or less than 20, or less 
than 15, or 10 or 9 or 8 or 7 or 6 or 5 or 4 or 3 or 2 amino acid differences, 
or a single amino acid difference, from the sequence listed in Figure 4. 

The invention also includes a CRCV S protein fragment comprising a 
fragment of the sequence listed in Figure 4 (SEQ ID NO: 4) which 
comprises at least one of the amino acids specific for CRCV S protein at the 
position listed in Table 1. 
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The invention includes a coronavirus S protein, or fragment thereof, having 
at least 75% amino acid sequence identity with BCV strain LY138 S protein 
(SEQ ID NO: 14, Genbank Accession No. AF058942), and comprising at 
least one of V at position 103; V at position 118; D at position 166; M at 

5 position 171; K at position 179; P at position 192; S at position 210; H at 
position 235; F at position 267; F at position 388; M at position 407; S at 
position 436; I at position 440; I at position 447; ; F at position 501; Y at 
position 525; N at position 528; L at position 540; K at position 582; G at 
position 608; G at position 692; S at position 695; W at position 757; G at 

10 position 758; Q at position 763; T at position 769jJB at position 786; H at 
position 792; R at position 818; P at position 827; V at position 828; F at 
position 887; D at position 933; F at position 977; T at position 1011; S at 
position 1018; K at position 1063; L at position 1256 and M at position 
1257. 

15 For the avoidance of doubt, the invention includes a coronavirus S protein, 
or fragment thereof, having at least 75% amino acid sequence identity with 
BCV strain LY138 S protein (SEQ ID NO: 14), and comprising at least one 
of the amino acids specific for CRCV S protein at the position listed in 
Table 1. 

20 The invention includes a coronavirus S protein, or fragment thereof, having 
at least 76% amino acid sequence identity with BCV strain LY138 S 
protein, or at least 77%, or at least 78%, or at least 79%, or at least 80%, or 
at least 81%, or at least 82%, or at least 83%, or at least 84%, or at least 
85%, or at least 86%, or at least 87%, or at least 88%, or at least 89%, or at 

25 least 90%, or at least 91%, or at least 92%, or at least 93%, or at least 94%, 
or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 
99% amino acid sequence identity with BCV strain LY138 S protein, and 
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having at least one of the amino acids specific for CRCV S protein at the 
position listed in Table 1. 

The invention also includes a coronavirus S protein, or fragment thereof, 
having at least 75%, or at least 80%, or at least 85%, or at least 90%, or at 
least 95% amino acid sequence identity with BCV strain LY138 S protein 
(SEQ ID NO: 14), and comprising at least 2, or at least 3, or at least 4, or at 
least 5, or at least 6, or at least 7, or at least 8, or at least 9, or at least 10, or 
at least 11, or at least 12, or at least 13, or at least 14, or at least 15, or at 
least 16, or at least 17, or at least 18, or at least 19, or at least 20, or at least 
21, or at least 22, or at least 23, or at least 24, or at least 25, or at least 26, or 
at least 27, or at least 28, or at least 29, or at least 30, or at least 31, or at 
least 32, or at least 33, or at least 34, or at least 35, or at least 36, or at least 
37, or at least 38 of the amino acids specific for CRCV S protein at the 
positions listed in Table 1 . 

Preferably, the coronavirus S protein, or fragment thereof comprises all 39 
of the amino acid residues specific for CRCV S protein at the positions 
listed in Table 1. 

A second aspect of the invention provides a coronavirus pol protein, or 
fragment thereof, having at least 90% amino acid sequence identity with the 
BCV pol protein (SEQ ID NO: 5) and comprising the amino acid E at the 
position corresponding to position 4975 in the BCV genome (Accession No. 
SWALL: Q9 1A29). 

The invention includes a coronavirus pol protein, or fragment thereof, 
having at least 91% amino acid sequence identity with BCV strain LY138 
pol protein, or at least 92%, or at least 93%, or at least 94%, or at least 95%, 
or at least 96%, or at least 97%, or at least 98%, or at least 99% amino acid 
sequence identity with BCV strain LY138 pol protein, and having the amino 
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acid E at the position corresponding to position 4975 in the BCV genome 
(Accession No. SWALL: Q91A29). 

Preferably, the coronavirus pol protein, or fragment thereof is a CRCV pol 
protein or fragment thereof that comprises or consists of the amino acid 
sequence listed in Figure 2. 

Thus the invention includes a BCV, HCV or HEV pol protein or fragment 
thereof, that has been modified at the amino acid corresponding to position 
4975 in the BCV genome, to resemble the CRCV pol protein. 

The invention also includes a CRCV pol protein fragment comprising a 
fragment of the sequence listed in Figure 2 (SEQ ID NO: 2) and having the 
amino acid E at the position corresponding to position 4975 in the BCV 
genome. 

A third aspect of the invention provides a coronavirus HE protein, or 
fragment thereof, having at least 90% amino acid sequence identity with the 
BCV LY138 HE protein (Genbank Accession No. AF058942), and having 
at least one of F at position 235; N at position 242; and L at position 253. 
The amino acid positions are numbered from the initial M (which is number 
1) at the start of the BCV HE protein. 

The invention includes a coronavirus HE protein, or fragment thereof, 
having at least 91% amino acid sequence identity with BCV strain LY138 
HE protein, or at least 92%, or at least 93%, or at least 94%, or at least 95%, 
or at least 96%, or at least 97%, or at least 98%, or at least 99% amino acid 
sequence identity with BCV strain LY138 HE protein, and having at least 
one of F at position 235; N at position 242; and L at position 253. The 
amino acid positions are numbered from the initial M (which is number 1) 
at the start of the BCV HE protein. 
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The invention also includes a coronavirus HE protein, or fragment thereof, 
having at least 90%, or at least 91%, or at least 92%, or at least 93%, or at 
least 94%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, 
or at least 99% amino acid sequence identity with BCV strain LY138 HE 
5 protein, and having two of F at position 235; N at position 242; and L at 
position 253. The amino acid positions are numbered from the initial M 
(which is number 1) at the start of the BCV HE protein. 

The invention further includes a coronavirus HE protein, or fragment 
thereof, having at least 90%, or at least 91%, or at least 92%, or at least 
10 93%, or at least 94%, or at least 95%, or at least 96%, or at least 97%, or at 
least 98%o, or at least 99% amino acid sequence identity with BCV strain 
LY138 HE protein, and having all three of F at position 235; N at position 
242; and L at position 253. The amino acid positions are numbered from 
the initial M (which is number 1) at the start of the BCV HE protein. 

15 Preferably, the coronavirus HE protein, or fragment thereof is a CRCV HE 
protein or fragment thereof that comprises or consists of the amino acid 
sequence listed in Figure 14 (SEQ ID NO: 22). 

Thus the invention includes a BCV, HCV, HECV or HEV HE protein or 
fragment thereof, that has been modified at one or more of the amino acids 
20 corresponding to position 235, 242; and 253 to resemble the CRCV HE 
protein. 

The invention also includes a CRCV HE protein fragment comprising a 
fragment of the sequence listed in Figure 14 (SEQ ID NO: 22) and having 
one or more of the amino acid F at position 235, N at position 242, and L at 
25 position 253. The numbering of these amino acid positions corresponds to 
that of BCV LY138 HE protein (Genbank Accession No. AF058942) in 
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which residue number 1 is the initial M at the start of the BCV LY138 HE 
protein. 

The coronavirus S, pol and HE proteins as defined above in the first, second 
and third aspects of the invention may be termed herein "CRCV" or 
"CRCV-like" proteins. 

A "CRCV S protein" is an S protein or fragment thereof that has the native 
CRCV S amino acid sequence as listed in Figure 4 (SEQ ID NO: 4), or a 
fragment thereof which comprises at least one of the amino acids specific 
for a CRCV S protein at the positions listed in Table 1. 

A "CRCV pol protein" is a pol protein or fragment thereof that has the 
native CRCV pol amino acid sequence as listed in Figure 2 (SEQ ID NO: 
2), or a fragment thereof which comprises the amino acid E at the position 
corresponding to position 4975 in the BCV genome. 

A "CRCV HE protein" is an HE protein or fragment thereof that has the 
native CRCV HE amino acid sequence as listed in Figure 14 (SEQ ID NO: 
22), or a fragment thereof which comprises one or more of the amino acid F 
at position 235, N at position 242, and L at position 253. The numbering of 
these amino acid positions corresponds to that of BCV LY138 HE protein 
(Genbank Accession No. AF058942) in which residue number 1 is the 
initial M at the start of the BCV LY138 HE protein. 

A "CRCV-like S protein" is an S protein or fragment thereof that does not 
have an amino acid sequence identical to the native CRCV S amino acid 
sequence (Figure 4 and SEQ ID NO: 4), but has at least 75% sequence 
identity with the corresponding region of the CRCV or BCV strain LY138 S 
protein, and has at least one of the amino acids specific for a CRCV S 
protein at the positions listed in Table 1 . 
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A "CRCV-like S protein" also includes an S protein that does not have an 
amino acid sequence identical to the native CRCV S amino acid sequence 
(Figure 4 and SEQ ID NO: 4), but that comprises or consists of a variant of 
the sequence listed in Figure 4 with at least 97% identity with the sequence 

5 listed in Figure 4. Preferably, the variant has at least 98%, or at least 99% 
amino acid sequence identity with the sequence listed in Figure 4. More 
preferably the variant has at least 99.1%, or at least 99.2%, or at least 
99.3%), or at least 99.4%, or at least 99.5%, or at least 99.6%, or at least 
99.7%o, or at least 99.8%, or at least 99.9%> amino acid sequence identity 

10 with the sequence listed in Figure 4. 

0 

A "CRCV-like pol protein" is a pol protein or fragment thereof that does 
not have an amino acid sequence identical to the native CRCV pol amino 
acid sequence, but has at least 90% sequence identity with the 
corresponding BCV strain LY138 pol protein, and which has an E at the 
15 position corresponding to position 4975 in the BCV genome. 

A "CRCV-like HE protein" is an HE protein or fragment thereof that does 
not have an amino acid sequence identical to the native CRCV HE amino 
acid sequence, but has at least 90%> sequence identity with the 
corresponding BCV strain LY138 HE protein, and which has one or more 
20 of the amino acid F at position 235, N at position 242, and L at position 
253. The numbering of these three amino acid positions corresponds to that 
of BCV LY138 HE protein (Genbank Accession No. AF058942) in which 
residue number 1 is the initial M at the start of the BCV LY138 HE protein. 

Preferably, the CRCV or CRCV-like protein, or fragment thereof, is at least 
25 10 amino acids in length. More preferably, the CRCV or CRCV-like 
protein, or fragment thereof, is at least 20, or at least 30, or at least 40, or at 
least 50, or at least 100, or at least 200, or at least 300, or at least 400, or at 



WO 2004/011651 



PCT/GB2003/002832 



18 

least 500, or at least 600, or at least 700, or at least 800, or at least 900, or at 
least 1,000, or at least 1,100, or at least 1,200 amino acids in length. 

Preferably, the CRCV or CRCV-like protein, or fragment thereof, is less 
than about 1,300 amino acids in length. More preferably, the CRCV or 
CRCV-like protein, or fragment thereof, is less than about 1,200, or less 
than about 1,100, or less than about 1,000, or less than about 900, or less 
than about 800, or less than about 700, or less than about 600, or less than 
about 500, or less than about 400, or less than about 300, or less than about 
200, or less than about 100, or less than about 50 amino acids in length. 

CRCV proteins may be isolated from CRCV, or may be made using protein 
chemistry techniques for example using partial proteolysis of isolated 
proteins (either exolytically or endolytically), or by de novo synthesis. 
Alternatively, the CRCV proteins, as well as CRCV-like proteins, may be 
made by recombinant DNA technology. Suitable techniques for cloning, 
manipulation, modification and expression of nucleic acids, and 
purification of expressed proteins, are well known in the art and are 
described for example in Sambrook et al (2001) "Molecular Cloning, a 
Laboratory Manual" , 3 rd edition, Sambrook et al (eds), Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, USA, incorporated herein by 
reference. 

Shorter fragments of CRCV and CRCV-like proteins, ie peptides, may be 
synthesised using standard techniques. Peptides may be synthesised by the 
Fmoc-polyamide mode of solid-phase peptide synthesis as disclosed by Lu et 
al (1981) J. Org. Chem. 46, 3433 and references therein. Temporary N-amino 
group protection is afforded by the 9-fluorenylmethyloxycarbonyl (Fmoc) 
group. Repetitive cleavage of this highly base-labile protecting group is 
effected using 20% piperidine in N,N-dimethylformamide. Side-chain 
functionalities may be protected as their butyl ethers (in the case of serine 
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threonine and tyrosine), butyl esters (in the case of glutamic acid and aspartic 
acid), butyloxycarbonyl derivative (in the case of lysine and histidine), trityl 
derivative (in the case of cysteine) and 4-methoxy-2,3,6- 
trimethylbenzenesulphonyl derivative (in the case of arginine). Where 
glutamine or asparagine are C-terminal residues, use is made of the 4,4'- 
dimethoxybenzhydryl group for protection of the side chain amido 
functionalities. The solid-phase support is based on a polydimethyl- 
acrylamide polymer constituted from the three monomers dimethylacrylamide 
(backbone-monomer), bisacryloylethylene diamine (cross linker) and 
acryloylsarcosine methyl ester (functionalising agent). The peptide-to-resin 
cleavable linked agent used is the acid-labile 4-hydroxymethyl-phenoxyacetic 
acid derivative. All amino acid derivatives are added as their preformed 
symmetrical anhydride derivatives with the exception of asparagine and 
glutamine, which are added using a reversed N,N-dicyclohexyl- 
carbodiimide/l-hydroxybenzotriazole mediated coupling procedure. All 
coupling and deprotection reactions are monitored using ninhydrin, 
trinitrobenzene sulphonic acid or isotin test procedures. Upon completion of 
synthesis, peptides are cleaved from the resin support with concomitant 
removal of side-chain protecting groups by treatment with 95% trifluoroacetic 
acid containing a 50% scavenger mix. Scavengers commonly used are 
ethanedithiol, phenol, anisole and water, the exact choice depending on the 
constituent amino acids of the peptide being synthesised. Trifluoroacetic acid 
is removed by evaporation in vacuo, with subsequent trituration with diethyl 
ether affording the crude peptide. Any scavengers present are removed by a 
simple extraction procedure which on lyophilisation of the aqueous phase 
affords the crude peptide free of scavengers. Reagents for peptide synthesis 
are generally available from Calbiochem-Novabiochem (UK) Ltd, 
Nottingham NG7 2QJ, UK. Purification may be effected by any one, or a 
combination of, techniques such as size exclusion chromatography, ion- 
exchange chromatography and (principally) reverse-phase high performance 
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liquid chromatography. Analysis of peptides may be carried out using thin 
layer chromatography, reverse-phase high performance liquid 
chromatography, amino-acid analysis after acid hydrolysis and by fast atom 
bombardment (FAB) mass spectrometric analysis. 

A fourth aspect of the invention provides a polynucleotide that encodes a 
CRCV or CRCV-like S, pol or HE protein according to the first, second and 
third aspects of the invention, or the complement thereof. 

Preferably, the polynucleotide encodes a CRCV S protein according to the 
first aspect of the invention, or the complement thereof 

More preferably, the polynucleotide encoding the CRCV S protein 
comprises or consists of the sequence listed in Figure 3 (SEQ ID NO: 3). 

It is appreciated that the sequence listed in Figure 3 (SEQ ID NO: 3) 
contains a Y at position 3531, which refers to either C or T. In both cases 
the corresponding amino acid is He. Thus the invention includes a 
polynucleotide encoding a CRCV S protein which comprises or consists of 
the sequence listed in Figure 3, and having C at position 3531. The 
invention also includes a polynucleotide encoding a CRCV S protein which 
comprises or consists of the sequence listed in Figure 3, and having T at 
position 3531. 

The invention also includes a CRCV S polynucleotide comprising a 
fragment of the sequence listed in Figure 3 (SEQ ID NO: 3), that encodes a 
protein having at least one of the amino acids specific for CRCV S protein 
at the position listed in Table 1, or the complement thereof. 

Preferably, the polynucleotide encoding the pol protein comprises or 
consists of the sequence listed in Figure 1 (SEQ ID NO: 1), or the 
complement thereof. 
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The invention also includes a CRCV pol polynucleotide comprising a 
fragment of the sequence listed in Figure 1 (SEQ ID NO: 1) that encodes a 
protein having E at the position corresponding to position 4975 in the BCV 
genome, or the complement thereof. 

Preferably, the polynucleotide encoding the HE protein comprises or 
consists of the sequence listed in Figure 13 (SEQ ID NO: 21), or the 
complement thereof. 

The invention also includes a CRCV HE polynucleotide comprising a 
fragment of the sequence listed in Figure 13 (SEQ ID NO: 21) that encodes 
a protein having one or more of the amino acid F at position 235, N at 
position 242, and L at position 253. The numbering of these three amino 
acid positions corresponds to that of BCV LY138 HE protein (Genbank 
Accession No. AF058942) in which residue number 1 is the initial M at the 
start of the BCV LY1 3 8 HE protein. 

The polynucleotides as defined above are referred to herein as CRCV or 
CRCV-like polynucleotides of the invention. 

A "CRCV-like polynucleotide" is a polynucleotide that does not have a 
base sequence identical to all or a fragment of the native CRCV cDNA 
sequence as listed in Figures 1, 3 and 13 (SEQ ID NOS: 1, 3 and 21), but 
that encodes a CRCV or CRCV-like S pol or HE protein as defined above, 
or the complement thereof 

The CRCV is a positive strand RNA virus. The polynucleotide of the 
invention may be DNA or RNA. The RNA may be positive or negative 
strand RNA. The DNA may be single or double stranded DNA. 

Suitable techniques for cloning and sequencing a cDNA from a positive 
strand RNA virus such as CRCV are well known in the art and are 
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described for example in Sambrook et al 2001, incorporated herein by 
reference. 

The CRCV or CRCV-like polynucleotides of the invention may be any 
suitable size. However, for certain purposes, such as probing or amplifying, 
it is preferred if the nucleic acid has fewer than 3,000, more preferably 
fewer than 1000, more preferably still from 10 to 100, and in further 
preference from 15 to 30 base pairs (if the nucleic acid is double-stranded) 
or bases (if the nucleic acid is single stranded). As is described more fully 
below, single-stranded DNA oligonucleotides, suitable for use as 
hybridisation probes or as primers in a polymerase chain reaction, are 
particularly preferred. 

Oligonucleotides that can specifically amplify, or hybridise to CRCV S, pol 
or HE polynucleotides, as opposed to BCV, HCV, HEV or enteric CCV S, 
pol or HE polynucleotides, are particularly preferred. Suitable 
oligonucleotides can be determined by a person of skill in the art by 
reference to the nucleotide sequence comparisons in Figures 6, 8, 9 and 15. 

It is appreciated that the CRCV or CRCV-like oligonucleotides may, even 
under highly stringent conditions, hybridise to nucleic acid, whether RNA 
or DNA, from HCV, BCV, and HEV as well as from CRCV. However, it is 
preferred if the CRCV or CRCV-like oligonucleotides hybridise to nucleic 
acid from CRCV under more stringent conditions than to nucleic acid from 
HCV, BCV or HEV. This can either be determined experimentally or by a 
comparison of the oligonucleotide sequence with the respective CRCV, 
HCV, BCV and HEV sequences, as is well known to one of skill in the art 
(Sambrook et al 2001). 

It is also appreciated that the CRCV or CRCV-like oligonucleotides may 
hybridise to nucleic acid, whether RNA or DNA, from the enteric CCV as 
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well as from CRCV. However, it is preferred if the CRCV or CRCV-like 
oligonucleotides hybridise to nucleic acid from CRCV under more stringent 
conditions than to nucleic acid from enteric CCV. This can either be 
determined experimentally or by a comparison of the oligonucleotide 
5 sequence with the respective sequences, as is well known to one of skill in 
the art (Sambrook et al 2001). Preferably, the oligonucleotides do not 
hybridise to nucleic acid from enteric CCV at all under stringent conditions 
(see below). 

Conveniently, the CRCV or CRCV-like polynucleotides or oligonucleotides 
io further comprise a detectable label. 

By "detectable label" is included any convenient radioactive label such as 
32 P, 33 P or 35 S which can readily be incorporated into a nucleic acid 
molecule using well known methods; any convenient fluorescent or 
chemiluminescent label which can readily be incorporated into a nucleic 

15 acid is also included. In addition the term "detectable label" also includes a 
moiety which can be detected by virtue of binding to another moiety (such 
as biotin which can be detected by binding to streptavidin); and a moiety, 
such as an enzyme, which can be detected by virtue of its ability to convert 
a colourless compound into a coloured compound, or vice versa (for 

20 example, alkaline phosphatase can convert colourless o- 
nitrophenylphosphate into coloured o-nitrophenol). Conveniently, the 
nucleic acid probe may occupy a certain position in a fixed array and 
whether a nucleic acid hybridises to it can be determined by reference to the 
position of hybridisation in the fixed array. 

25 Labelling with [ 32 P]dCTP may be carried out using a Rediprime® random 
primer labelling kit supplied by Amersham. 
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Primers which are suitable for use in a polymerase chain reaction (PGR; 
Saiki et al (1988) Science 239, 487-491) are preferred. Suitable PGR 
primers may have the following properties: 

It is well known that the sequence at the 5' end of the oligonucleotide need 
not match the target sequence to be amplified. 

It is usual that the PCR primers do not contain any complementary 
structures with each other longer than 2 bases, especially at their 3' ends, as 
this feature may promote the formation of an artefactual product called 
"primer dimer". When the 3' ends of the two primers hybridise, they form a 
"primed template" complex, and primer extension results in a short duplex 
product called "primer dimer". 

Internal secondary structure should be avoided in primers. For symmetric 
PCR, a 40-60% G+C content is often recommended for both primers, with 
no long stretches of any one base. The classical melting temperature 
calculations used in conjunction with DNA probe hybridization studies 
often predict that a given primer should anneal at a specific temperature or 
that the 72°C extension temperature will dissociate the primer/template 
hybrid prematurely. In practice, the hybrids are more effective in the PCR 
process than generally predicted by simple T m calculations. 

Optimum annealing temperatures may be determined empirically and may 
be higher than predicted. Taq DNA polymerase does have activity in the 
37-5 5°C region, so primer extension will occur during the annealing step 
and the hybrid will be stabilised. The concentrations of the primers are 
equal in conventional (symmetric) PCR and, typically, within 0.1- to ljaM 
range. 
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It will further be appreciated that if a control amplification reaction is to be 
carried out, for example using primers complementary to an ubiquitously 
expressed gene, that it may be beneficial for the products of the control and 
CRCV or CRCV-like products to be of different sizes, such that the two 

5 products may be distinguished by the detection means employed, for 
example by mobility on agarose gel electrophoresis. However, it may be 
desirable for the two products to be of similar size, for example both 
between 100 and 1000, or between 100 and 600 nucleotides long. This may 
aid simultaneous analysis of the products, for example by gel 

10 electrophoresis, and may also mean that the control and CRCV or CRCV- 
like amplification reactions may have similar performance characteristics, in 
terms, for example, of relative rates of accumulation of product at different 
stages during the reaction. 

Any of the nucleic acid amplification protocols can be used in the method 
15 of the invention including the polymerase chain reaction, QB replicase and 
ligase chain reaction. Also, NASBA (nucleic acid sequence based 
amplification), also called 3SR, can be used as described in Compton 
(1991) Nature 350, 91-92 and AIDS (1993), Vol 7 (Suppl 2), S108 or SDA 
(strand displacement amplification) can be used as described in Walker et al 
20 (1992) Nucl Acids Res, 20, 1691-1696. The polymerase chain reaction is 
particularly preferred because of its simplicity. 

When a pair of suitable nucleic acids of the invention are used in a PGR it is 
convenient to detect the product by gel electrophoresis and ethidium 
bromide staining. As an alternative, it is convenient to use a labelled 
25 oligonucleotide capable of hybridising to the amplified DNA as a probe. 
When the amplification is by PGR the oligonucleotide probe hybridises to 
the interprimer sequence as defined by the two primers. The 
oligonucleotide probe is preferably between 10 and 50 nucleotides long, 
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more preferably between 15 and 30 nucleotides long. It may be longer than 
the amplified DNA or include one or both of the primers, but in this case, 
the hybridisation conditions should be such that the probe should not 
hybridise to the primers alone, but only to an amplified product that also 
contains interprimer sequence that is capable of hybridising to the probe. 

The probe may be labelled with a radionuclide such as 32 P, 33 P and 35 S using 
standard techniques, or may be labelled with a fluorescent dye. When the 
oligonucleotide probe is fluorescently labelled, the amplified DNA product 
may be detected in solution (see for example Balaguer et al (1991). 
"Quantification of DNA sequences obtained by polymerase chain reaction 
using a bioluminescence adsorbent" Anal. Biochem. 195, 105-110 and 
Dilesare et al (1993) "A high- sensitivity electrochemiluminescence-based 
detection system for automated PCR product quantitation" BioTechniques 
15, 152-157. 

PCR products can also be detected using a probe which may have a 
fluorophore-quencher pair or may be attached to a solid support or may 
have a biotin tag or they may be detected using a combination of a capture 
probe and a detector probe. 

Fluorophore-quencher pairs are particularly suited to quantitative 
measurements of PCR reactions (eg RT-PCR). Fluorescence polarisation 
using a suitable probe may also be used to detect PCR products. 

The invention also includes a vector comprising the CRCV or CRCV-like 
polynucleotide of the fourth aspect of the invention. 

Typical prokaryotic vector plasmids are: pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories (Richmond, CA, USA); 
p7rc99A, pKK223~3, pKK233-3, pDR540 and pRIT5 available from 
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Pharmacia (Piscataway, NJ, USA); pBS vectors, Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16A, pNH18A, pNH46A available from 
Stratagene Cloning Systems (La Jolla, CA 92037, USA). 

A typical mammalian cell vector plasmid is pSVL available from Pharmacia 
5 (Piscataway, NJ, USA). This vector uses the SV40 late promoter to drive 
expression of cloned genes, the highest level of expression being found in T 
antigen-producing cells, such as COS-1 cells. An example of an inducible 
mammalian expression vector is pMSG, also available from Pharmacia 
(Piscataway, NJ, USA). This vector uses the glucocorticoid-inducible 
10 promoter of the mouse mammary tumour virus long terminal repeat to drive 
expression of the cloned gene. 

Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems (La Jolla, CA 92037, 
USA). Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast Integrating 
15 plasmids (Yips) and incorporate the yeast selectable markers HIS3, TRP1, 
LEU2 and URA3. Plasmids pRS413-416 are Yeast Centromere plasmids 
(YCps). 

Generally, the CRCV or CRCV-like polynucleotide of the invention is 
inserted into an expression vector, such as a plasmid, in proper orientation and 

20 correct reading frame for expression. It may be linked to the appropriate 
transcriptional and translational regulatory control nucleotide sequences 
recognised by the desired host prior to insertion into the vector, although such 
controls are generally available in the expression vector. Thus, the 
polynucleotide of the invention insert may be operatively linked to an 

25 appropriate promoter. Eukaryotic promoters include the CMV immediate 
early promoter, the HSV thymidine kinase promoter, the early and late SV40 
promoters and the promoters of retroviral LTRs. Other suitable promoters 
will be known to the skilled artisan. The expression constructs desirably also 



WO 2004/011651 



PCT/GB2003/002832 



28 

contain sites for transcription initiation and termination, and in the transcribed 
region, a ribosome binding site for translation (Hastings et al 9 International 
Patent No. WO 98/16643). 

Methods well known to those skilled in the art can be used to construct 
5 expression vectors containing the coding sequence and, for example 
appropriate transcriptional or translational controls. One such method 
involves ligation via homopolymer tails. Homopolymer polydA (or polydC) 
tails are added to exposed 3 ' OH groups on the DNA fragment to be cloned by 
terminal deoxynucleotidyl transferases. The fragment is then capable of 
10 annealing to the polydT (or polydG) tails added to the ends of a linearised 
plasmid vector. Gaps left following annealing can be filled by DNA 
polymerase and the free ends joined by DNA ligase. 

Another method involves ligation via cohesive ends. Compatible cohesive 
ends can be generated on the DNA fragment and vector by the action of 
15 suitable restriction enzymes. These ends will rapidly anneal through 
complementary base pairing and remaining nicks can be closed by the action 
of DNA ligase. 

A further method uses synthetic molecules called linkers and adaptors. DNA 
fragments with blunt ends are generated by bacteriophage T4 DNA 

20 polymerase or E.coli DNA polymerase I which remove protruding 3' termini 
and fill in recessed 3 5 ends. Synthetic linkers, pieces of blunt-ended double- 
stranded DNA which contain recognition sequences for defined restriction 
enzymes, can be ligated to blunt-ended DNA fragments by T4 DNA ligase. 
They are subsequently digested with appropriate restriction enzymes to create 

25 cohesive ends and ligated to an expression vector with compatible termini. 
Adaptors are also chemically synthesised DNA fragments which contain one 
blunt end used for ligation but which also possess one preformed cohesive 
end. 
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Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

A desirable way to modify the polynucleotide of the invention is to use the 
5 polymerase chain reaction as disclosed by Saiki et al (1988) Science 239, 487- 
491. In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated into 
the amplified DNA. The specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into expression 
io vectors using methods known in the art. 

The invention also includes a host cell transformed with the vector 
comprising the CRCV or CRCV-like polynucleotide. The host cell can be 
either prokaryotic or eukaryotic. If the CRCV or CRCV-like 
polynucleotide, in the vector, is to be expressed as a glycoprotein, the host 
15 cell is a eukaryotic host cell, and preferably a mammalian host cell. 

Bacterial cells are preferred prokaryotic host cells and typically are a strain 
of E. coli such as, for example, the E. coli strains DH5 available from 
Bethesda Research Laboratories Inc., Bethesda, MD, USA, and RR1 
available from the American Type Culture Collection (ATCC) of Rockville, 

20 MD, USA (No ATCC 3 1343). Preferred eukaryotic host cells include yeast 
and mammalian cells, preferably vertebrate cells such as those from a 
mouse, rat, monkey or human fibroblastic cell line. Yeast host cells include 
YPH499, YPH500 and YPH501 which are generally available from 
Stratagene Cloning Systems, La Jolla, CA 92037, USA. Preferred 

25 mammalian host cells include Chinese hamster ovary (CHO) cells available 
from the ATCC as CCL61, NIH Swiss mouse embryo cells NIH/3T3 
available from the ATCC as CRL 1658, and monkey kidney-derived COS-1 
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cells available from the ATCC as CRL 1650. Preferred insect cells are Sf9 
cells which can be transfected with baculovirus expression vectors. 

Transformation of appropriate cell hosts with a vector is accomplished by 
well known methods that typically depend on the type of vector used. With 
regard to transformation of prokaryotic host cells, see, for example, Cohen et 
al (1972) Proc. Natl. Acad. Sci. USA 69, 2110 and Sambrook et al (2001)- 
Molecular Cloning, A Laboratory Manual, 3 rd Ed. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY. Transformation of yeast cells is 
described in Sherman et al (1986) Methods In Yeast Genetics, A Laboratory 
Manual, Cold Spring Harbor, NY. The method of Beggs (1978) Nature 275, 
104-109 is also useful. With regard to vertebrate cells, reagents useful in 
transfecting such cells, for example calcium phosphate and DEAE-dextran or 
liposome formulations, are available from Stratagene Cloning Systems, or 
Life Technologies Inc., Gaithersburg, MD 20877, USA. 

Electroporation is also useful for transforming cells and is well known in the 
art for transforming yeast cell, bacterial cells and vertebrate cells. 

For example, many bacterial species may be transformed by the methods 
described in Luchansky et al (1988) Mol. Microbiol. 2, 637-646 incorporated 
herein by reference. The greatest number of transformants is consistently 
recovered following electroporation of the DNA-cell mixture suspended in 
2.5x PEB using 6250V per cm at 25uFD. 

Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods Enzymol. 194, 182. 

Physical methods may be used for introducing DNA into animal and plant 
cells. For example, microinjection uses a very fine pipette to inject DNA 
molecules directly into the nucleus of the cells to be transformed. Another 
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example involves bombardment of the cells with high-velocity 
microprojectiles, usually particles of gold or tungsten that have been coated 
with DNA 

Successfully transformed cells, ie cells that contain a CRCV or CRCV-like 
5 DNA construct, can be identified by well known techniques. For example, 
one selection technique involves incorporating into the expression vector a 
DNA sequence (marker) that codes for a selectable trait in the transformed 
cell. These markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture, and tetracyclin, kanamycin or ampicillin 
10 resistance genes for culturing in E.coli and other bacteria. Alternatively, the 
gene for the selectable trait can be on another vector, which is used to co- 
transform the desired host cell. 

The marker gene can be used to identify transformants but it is desirable to 
determine which of the cells contain recombinant DNA molecules and which 
15 contain self-ligated vector molecules. This can be achieved by using a 
cloning vector where insertion of a DNA fragment destroys the integrity of 
one of the genes present on the molecule. Recombinants can therefore be 
identified because of loss of function of that gene. 

Another method of identifying successfully transformed cells involves 
20 growing the cells resulting from the introduction of an expression construct of 
the present invention to produce the CRCV or CRCV-like S, pol or HE 
protein. Cells can be harvested and lysed and their DNA content examined 
for the presence of the DNA using a method such as that described by 
Southern (1975) J. Mol Biol. 98, 503 or Berent et al (1985) Biotech. 3, 208. 
25 Alternatively, the presence of the protein in the supernatant can be detected 
using antibodies as described below. 
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In addition to directly assaying for the presence of recombinant DNA, 
successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression of 
the protein. For example, cells successfully transformed with an expression 
vector produce proteins displaying appropriate antigenicity. Samples of cells 
suspected of being transformed are harvested and assayed for the protein using 
suitable antibodies. 

Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a monoclonal 
(clonally homogeneous) culture, or a culture derived from a monoclonal 
culture, in a nutrient medium. 

Host cells that have been transformed by the recombinant CRCV or CRCV- 
like polynucleotide, typically in a vector as described above, are then cultured 
for a sufficient time and under appropriate conditions known to those skilled 
in the art in view of the teachings disclosed herein to permit the expression of 
the CRCV or CRCV-like protein encoded by the CRCV or CRCV-like 
polynucleotide, which can then be recovered. 

The CRCV or CRCV-like protein can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium 
sulphate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography 
and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. 

For example, for expression in a baculovirus system, recombinant DNA 
encoding the CRCV spike gene may be cloned into a suitable transfer vector 
such as pMelBac (Invitrogen). Co-transfection with baculovirus DNA (eg 
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Bac-N-Blue/Invitrogen) results in a recombinant baculovirus encoding the 
spike gene. Infection of a suitable insect cell line (e.g. Sf9, Sf21, High 
Five/Invitrogen) at an appropriate multiplicity of infection leads to 
expression of the recombinant spike protein. Protein expression is 
5 confirmed by western blotting or ELISA using appropriate reagents (e.g. 
convalescent canine serum or other virus specific antiserum). 

The invention thus includes a method of obtaining a CRCV or CRCV-like 
protein encoded by the CRCV or CRCV-like polynucleotide of the present 
invention. The method comprises culturing the host cell comprising the 
10 CRCV or CRCV-like polynucleotide, typically in a vector; expressing the 
protein in the host cell, and purifying the protein. The invention further 
includes the protein obtainable by this method. 

The invention thus also includes a method of obtaining a glycosylated 
CRCV or CRCV-like protein, typically an S protein, encoded by the CRCV 

15 or CRCV-like polynucleotide of the present invention. The method 
comprises culturing a eukaryotic, or more preferably mammalian, host cell 
comprising the CRCV or CRCV-like polynucleotide, typically in a vector; 
expressing the protein in the host cell; and purifying the glycosylated 
protein. The invention further includes the glycosylated protein obtainable 

20 by this method. 

In a fifth aspect, the invention provides a method of making an anti-CRCV 
antibody comprising raising an immune response to a CRCV or CRCV-like 
S protein of the invention as described above in the first aspect of the 
invention in an animal, and preparing an antibody from the animal or from 
25 an immortal cell derived therefrom. Alternatively, the method may 
comprise selecting an antibody from an antibody-display library using a 
CRCV or CRCV-like S protein of the invention as described above in the 
first aspect of the invention. 
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Methods and techniques for producing a monoclonal antibody are well 
known to a person of skill in the art, for example those disclosed in 
"Monoclonal Antibodies: A manual of techniques' 3 , H Zola (CRC Press, 
1988) and in "Monoclonal Hybridoma Antibodies: Techniques and 
5 Applications", J G R Hurrell (CRC Press, 1982), incorporated herein by 
reference. 

Optionally, the method further comprises determining whether the antibody 
thus obtained has greater affinity for the CRCV S protein than for the BCV 
S protein, and preferably also whether the antibody has a greater affinity for 
10 the CRCV S protein than for the HCV and HEV S proteins. Methods for 
determining the relative affinity of antibodies for antigens are known in the 
art. 

The invention also includes an anti-CRCV antibody obtainable by the 
method of the fifth aspect of the invention, that has greater affinity for the 
15 CRCV S protein than for the BCV S protein. Preferably, the antibody also 
has a greater affinity for the CRCV S protein than for the HCV and HEV S 
proteins. 

The invention also includes a method of making an anti-CRCV antibody 
comprising raising an immune response to a CRCV or CRCV-like HE 

20 protein of the invention as described above in the third aspect of the 
invention in an animal, and preparing an antibody from the animal or from 
an immortal cell derived therefrom. Alternatively, the method may 
comprise selecting an antibody from an antibody-display library using a 
CRCV or CRCV-like HE protein of the invention as described above in the 

25 third aspect of the invention. 

Optionally, the method further comprises determining whether the antibody 
thus obtained has greater affinity for the CRCV HE protein than for the 
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BCV HE protein, and preferably also whether the antibody has a greater 
affinity for the CRCV HE protein than for the HCV and HEV HE proteins. 
Methods for determining the relative affinity of antibodies for antigens are 
known in the art. 

The invention also includes an anti-CRCV antibody obtainable by the 
method of the fifth aspect of the invention, that has greater affinity for the 
CRCV HE protein than for the BCV HE protein. Preferably, the antibody 
also has a greater affinity for the CRCV HE protein than for the HCV and 
HEV HE proteins. 

Preferably, the antibody is a monoclonal antibody. However, the invention 
includes a monospecific anti-CRCV antibody. The antibody may be an 
antibody fragment, as described below. 

The monoclonal or monospecific antibody may be a chimaeric antibody, as 
discussed by Neuberger et al (1988, 8th International Biotechnology 
Symposium Part 2, 792-799). The monoclonal or monospecific antibody may 
also be a "caninised" antibody, for example by inserting the CDR regions of 
mouse antibodies into the framework of canine antibodies. 

The invention also includes anti-CRCV antibody fragments. The variable 
heavy (V H ) and variable light (V L ) domains of antibodies are involved in 
antigen recognition, a fact first recognised by early protease digestion 
experiments. Further confirmation was found by "humanisation" of rodent 
antibodies, in which variable domains of rodent origin may be fused to 
constant domains of human origin such that the resultant antibody retains the 
antigenic specificity of the rodent parented antibody (Morrison et al (1984) 
Proc. Natl. Acad. Sci. USA 81, 6851-6855). 
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That antigenic specificity is conferred by variable domains and is independent 
of the constant domains is known from experiments involving the bacterial 
expression of antibody fragments, all containing one or more variable 
domains. 

These molecules include Fab-like molecules (Better et al (1988) Science 240, 
1041); Fv molecules (Skerra et al (1988) Science 240, 1038); single-chain Fv 
(ScFv) molecules where the V H and V L partner domains are linked via a 
flexible oligopeptide (Bird et al (1988) Science 242, 423; Huston et al (1988) 
Proc. Natl. Acad. Sci. USA 85, 5879) and single domain antibodies (dAbs) 
comprising isolated V domains (Ward et al (1989) Nature 341, 544). A 
general review of the techniques involved in the synthesis of antibody 
fragments which retain their specific binding sites is to be found in Winter & 
Milstein (1991) Nature 349, 293-299. 

By "ScFv molecules" we mean molecules wherein the V H and V L partner 
domains are linked via a flexible oligopeptide. 

The advantages of antibody fragments, rather than whole antibodies, are 
several-fold. The smaller size of the fragments may lead to improved 
pharmacological properties, such as better penetration of solid tissue. Effector 
functions of whole antibodies, such as complement binding, are removed. 
Fab, Fv, ScFv and dAb antibody fragments can all be expressed in and 
secreted from E. coli, thus allowing the facile production of large amounts of 
the fragments. 

Whole antibodies, and F(ab') 2 fragments are "bivalent". By "bivalent" we 
mean that the antibodies and F(ab*) 2 fragments have two antigen combining 
sites. In contrast, Fab, Fv, ScFv and dAb fragments are monovalent, having 
only one antigen combining sites. 
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In a sixth aspect, the invention provides a method of determining whether a 
dog has been exposed to CRCV. The method comprises obtaining a 
suitable sample from the dog, and identifying CRCV or an anti-CRCV 
antibody in the sample. The method may be used as an aid in the diagnosis 
5 of whether a dog has CIRD. 

The invention includes a method of detecting, in a sample obtained from a 
dog, past exposure of the dog to CRCV, the method comprising obtaining a 
suitable sample from the dog, and identifying anti-CRCV antibodies in the 
sample. 

10 In one preferred embodiment, the suitable sample can be any antibody 
containing sample such as serum, saliva, tracheal wash or bronchiolar 
lavage. 

Preferably, the anti-CRCV antibody can be detected using a BCV, HCV, 
HEV or CRCV antigen, more preferably, using a BCV or CRCV antigen. 

15 More preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to an S protein 
whose amino acid sequence is at least 75% identical with the amino acid 
sequence of the CRCV S protein (Figure 4 and SEQ ID NO: 4); an S protein 
whose amino acid sequence is at least 75% identical with the amino acid 

20 sequence of the BCV S protein (Genbank Accession No. AF058942); HCV 
S protein (Genbank Accession No. L14643); to a coronavirus having an S 
protein at least 75% identical with BCV S protein (Genbank Accession No. 
AF058942), or a fragment thereof; or to a coronavirus having an S protein 
at least 75% identical with the CRCV S protein, or a fragment thereof. 



25 More preferably, identifying an antibody that selectively binds to an S 
protein whose amino acid sequence is at least 75% identical with the amino 
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acid sequence of the BCV S protein, comprises identifying an antibody that 
selectively binds to an S protein whose amino acid sequence is at least 80% 
identical, or at least 85% identical, or at least 90% identical, or at least 95% 
identical with the amino acid sequence of the BCV S protein (Genbank 
Accession No. AF058942) or a fragment thereof. 

More preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to the BCV S 
protein (Genbank Accession No. AF058942). 

Even more preferably, identifying an antibody that selectively binds to an S 
protein whose amino acid sequence is at least 75% identical with the amino 
acid sequence of the CRCV S protein, comprises identifying an antibody 
that selectively binds to an S protein whose amino acid sequence is at least 
80% identical, or at least 85% identical, or at least 90% identical, or at least 
95% identical with the amino acid sequence of the CRCV S protein (Figure 
4 and SEQ ID NO: 4) or a fragment thereof. 

Yet more preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to a CRCV or 
CRCV-like S protein as defined in the first aspect of the invention. 

Most preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to the CRCV S 
protein as listed in Figure 4 (SEQ ID NO: 4), or a fragment thereof. 

Similarly, identifying an anti-CRCV antibody in the sample may comprise 
identifying an antibody that selectively binds to an HE protein whose amino 
acid sequence is at least 90% identical with the partial amino acid sequence 
of the CRCV HE protein (Figure 14 and SEQ ID NO: 22); to an HE protein 
whose amino acid sequence is at least 90% identical with the amino acid 



WO 2004/011651 



PCT/GB2003/002832 



39 

sequence of the BCV HE protein (Genbank Accession No. AF058942) or 
the HECV HE protein (Genbank Accession No. L07747); to a coronavirus 
having an S protein at least 90% identical with BCV HE protein (Genbank 
Accession No. AF058942), or a fragment thereof; or to a coronavirus 
5 having an HE protein at least 90% identical with the CRCV HE protein, or a 
fragment thereof. 

More preferably, identifying an antibody that selectively binds to an HE 
protein whose amino acid sequence is at least 90% identical with the amino 
acid sequence of the BCV HE protein, comprises identifying an antibody 

10 that selectively binds to an HE protein whose amino acid sequence is at 
least 91% identical, or at least 92% identical, or at least 93% identical, or at 
least 94% identical, or at least 95% identical, or at least 96% identical, or at 
least 97% identical, or at least 98% identical, or at least 99% identical with 
the amino acid sequence of the BCV HE protein (Genbank Accession No. 

15 AF058942) or a fragment thereof. 

More preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to the BCV HE 
protein (Genbank Accession No. AF058942). 

Even more preferably, identifying an antibody that selectively binds to an 
20 HE protein whose amino acid sequence is at least 90% identical with the 
partial amino acid sequence of the CRCV HE protein, comprises identifying 
an antibody that selectively binds to an HE protein whose partial amino acid 
sequence is is at least 91% identical, or at least 92% identical, or at least 
93% identical, or at least 94% identical, or at least 95% identical, or at least 
25 96% identical, or at least 97% identical, or at least 98% identical, or at least 
99% identical with the partial amino acid sequence of the CRCV HE protein 
(Figure 1 3) or a fragment thereof. 
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Yet more preferably, identifying an anti-CRCV antibody in the sample 
comprises identifying an antibody that selectively binds to a CRCV or 
CRCV-like HE protein as defined in the third aspect of the invention. 

Most preferably, identifying an anti-CRCV antibody in the sample 
5 comprises identifying an antibody that selectively binds to the CRCV HE 
protein whose partial amino acid sequence is listed in Figure 14 (SEQ ID 
NO: 22), or a fragment thereof 

The invention includes a method of detecting CRCV in a sample obtained 
from a dog, the method comprising obtaining a suitable sample from the 
10 dog, and identifying CRCV in the sample. 

It is appreciated that there may be some naturally occurring sequence 
variation between different isolates of CRCV. The invention thus includes 
identifying CRCV isolates whose S, pol and HE genes and proteins have 
some sequence variation from the sequences provided in Figures 1 to 4 and 
15 13 and 14. It is appreciated, however, that the same methods will be used to 
detect the variant isolates of CRCV, as well as the isolate characterised by 
the sequences listed in Figures 1 to 4 and 13 and 14. 

In a preferred embodiment, the suitable sample can be a lung wash, tracheal 
wash, tonsillar swab or a biopsy or post-mortem sample from the respiratory 
20 tract of the dog. 

Preferably, in this embodiment, identifying CRCV comprises identifying a 
nucleic acid component of CRCV. 

Typically, this will be performed by extracting RNA from the sample, and 
obtaining cDNA therefrom, for example as is described in Example 1. 
25 Thereafter, a CRCV nucleic acid component is identified in the cDNA, for 
example using techniques involving high stringency hybridisation, specific 
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amplification, and nucleotide sequencing, as are well known to a person of 
skill in the art (Sambrook et al (2001) supra). 

Preferably, identifying CRCV comprises identifying a polynucleotide that 
hybridises at high stringency to the BCV genome, such as the LY138 strain 
genome (Genbank Accession No. AF058942) or a portion thereof. 

Further preferably, identifying CRCV comprises identifying a 
polynucleotide that hybridises at high stringency to the CRCV S, pol or HE 
polynucleotides (Figures 1, 3 and 13) or a portion thereof. 

By "hybridising at high stringency" is meant that the polynucleotide and the 
nucleic acid to which it hybridises have sufficient nucleotide sequence 
similarity that they can hybridise under highly stringent conditions. As is 
well known in the art, the stringency of nucleic acid hybridisation depends 
on factors such as length of nucleic acid over which hybridisation occurs, 
degree of identity of the hybridising sequences and on factors such as 
temperature, ionic strength and CG or AT content of the sequence. 

Nucleic acids which can hybridise at high stringency to the CRCV cDNA 
molecule include nucleic acids which have >90% sequence identity, 
preferably those with >95% or >96% or >97% or >98, more preferably 
those with >99% sequence identity, over at least a portion of the CRCV 
cDNA. 

Typical highly stringent hybridisation conditions which lead to selective 
hybridisation are known in the art, for example those described in 
Sambrook et al 2001 {supra), incorporated herein by reference. 

An example of a typical hybridisation solution when a nucleic acid is 
immobilised on a nylon membrane and the probe nucleic acid is> 500 bases 
is: 
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6 x SSC (saline sodium citrate) 

0.5% sodium dodecyl sulphate (SDS) 

100 (xg/ml denatured, fragmented salmon sperm DNA 

The hybridisation is performed at 68°C. The nylon membrane, with the 
nucleic acid immobilised, may be washed at 68°C in 0.1 x SSC. 

20 x SSC may be prepared in the following way. Dissolve 175.3 g of NaCl 
and 88.2 g of sodium citrate in 800 ml of H 2 0. Adjust the pH to 7.0 with a 
few drops of a 10 N solution of NaOH. Adjust the volume to 1 litre with 
H 2 0. Dispense into aliquots. Sterilise by autoclaving. 

An example of a typical hybridisation solution when a nucleic acid is 

0 

immobilised on a nylon membrane and the probe is an oligonucleotide of 
between 1 5 and 50 bases is: 

3.0 M trimethylammonium chloride (TMAC1) 
0.01 M sodium phosphate (pH 6.8) 
1 mm EDTA (pH 7.6) 
0.5% SDS 

100 ug/ml denatured, fragmented salmon sperm DNA 
0.1% non-fat dried milk 

The optimal temperature for hybridisation is usually chosen to be 5°C 
below the Tj for the given chain length. Tj is the irreversible melting 
temperature of the hybrid formed between the probe and its target sequence. 
Jacobs et al (1988) Nucl. Acids Res. 16, 4637 discusses the determination of 
TiS. The recommended hybridization temperature for 17-mers in 3M 
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TMAC1 is 48-50°C; for 19-mers, it is 55-57°C; and for 20-mers, it is 58- 
66°C. 

Preferably, identifying CRCV comprises using a polynucleotide having at 
least 80%, or at least 85%, or at least 90%, or at least 95% identity with a 
portion of the BCV genome (Genbank Accession No. AF058942). 

More preferably, identifying CRCV comprises using a polynucleotide 
having at least 80%, or at least 85%, or at least 90%, or at least 95% identity 
with a portion of the CRCV S polynucleotide (Figure 3), or having at least 
90%, or at least 95% identity with a portion of the CRCV pol 
polynucleotide (Figure 1), or having at least 90%, or at least 95% identity 
with a portion of the CRCV HE polynucleotide (Figure 13). 

More preferably, identifying CRCV comprises identifying a CRCV 
polynucleotide as defined above with respect to the fourth aspect of the 
invention. 

Most preferably, identifying CRCV comprises identifying a CRCV 
polynucleotide comprising or consisting of a sequence listed in Figure 1 or 
Figure 3 or Figure 13, or a fragment thereof. 

In another preferred embodiment, identifying CRCV comprises identifying 
a protein component of CRCV. 

Preferably, identifying a protein component of CRCV comprises identifying 
a CRCV protein as defined above in the first or second or third aspects of 
the invention. 

Most preferably, identifying a protein component of CRCV comprises 
identifying a CRCV protein comprising or consisting of the amino acid 
sequence listed in Figure 2 or Figure 4 or Figure 14, or a fragment thereof. 
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Assaying a protein component of CRCV in a biological sample can occur 
using any art-known method. Preferred for assaying CRCV protein levels 
in a biological sample are antibody-based techniques. 

Preferably, identifying a protein component of CRCV comprises using an 
antibody reactive with CRCV. 

More preferably, the antibody reactive with CRCV is an anti-BCV 
antibody, an anti-HCV antibody, an anti-HEV antibody, or an anti-CRCV 
antibody obtainable or obtained by the methods of the fifth aspect of the 
invention. 

For example, CRCV protein expression can be studied with classical 
immunohistological methods. In these, the specific recognition is provided 
by the primary antibody (polyclonal or monoclonal) but the secondary 
detection system can utilise fluorescent, enzyme, or other conjugated 
secondary antibodies. As a result, an immunohistological staining of tissue 
section for pathological examination is obtained. Tissues can also be 
extracted, e.g., with urea and neutral detergent, for the liberation of CRCV 
protein for Western-blot or dot/slot assay (Jalkanen, M., et al, J. Cell. Biol. 
101:976-985 (1985); Jalkanen, M., et al, J. Cell. Biol. 105:3087-3096 
(1987)). In this technique, which is based on the use of cationic solid 
phases, quantitation of CRCV protein can be accomplished using isolated 
CRCV protein as a standard. This technique can also be applied to body 
fluid samples. 

Other antibody-based methods useful for detecting CRCV protein 
expression include immunoassays, such as the enzyme linked 
immunosorbent assay (ELISA) and the radioimmunoassay (RIA). For 
example, a CRCV reactive monoclonal antibody can be used both as an 
immunoadsorbent and as an enzyme-labeled probe to detect and quantify 
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theCRCV protein. The amount of CRCV protein present in the sample can 
be calculated by reference to the amount present in a standard preparation 
using a linear regression computer algorithm. Such an ELISA for detecting 
a tumour antigen is described in Iacobelli et al, Breast Cancer Research and 
Treatment 11: 19-30 (1988). In another ELISA assay, two distinct specific 
monoclonal antibodies can be used to detect CRCV protein in a body fluid. 
In this assay, one of the antibodies is used as the immunoadsorbent and the 
other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting CRCV protein 
with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before 
contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to 
immobilize one component of the assay system on a support, thereby 
allowing other components of the system to be brought into contact with the 
component and readily removed from the sample. 

Suitable enzyme labels include, for example, those from the oxidase group, 
which catalyze the production of hydrogen peroxide by reacting with 
substrate. Glucose oxidase is particularly preferred as it has good stability 
and its substrate (glucose) is readily available. Activity of an oxidase label 
may be assayed by measuring the concentration of hydrogen peroxide 
formed by the enzyme-labeled antibody/substrate reaction. Besides 

1 25 

enzymes, other suitable labels include radioisotopes, such as iodine ( I, 
121 I), carbon ( 14 C), sulfur 35 S), tritium ( 3 H), indium ( 112 In), and technetium 
( 99 mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 
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In a seventh aspect, the invention provides an immunosorbent assay for 
detecting anti-CRCV S or HE antibodies. The assay comprises a solid 
phase coated with a CRCV or CRCV-like S or HE protein, or coated with 
both CRCV or CRCV-like S and HE proteins as defined in the first and 
5 third aspects of the invention, or obtainable using the methods of the fourth 
aspect of the invention, or an antigenic fragment, thereof, wherein anti- 
CRCV S or HE antibodies in a sample exposed to the solid phase will bind 
to the protein; and a detectable label conjugate which will bind to the anti- 
CRCV antibodies bound to the solid phase. 

10 It is appreciated that an antigenic fragment of the CRCV or CRCV-like S 
protein that coats the solid phase is of sufficient size to be bound by an anti- 
CRCV S antibody, and which comprises at least one of the amino acids 
specific for CRCV S protein as listed in Table 1. 

It is also appreciated that an antigenic fragment of the CRCV or CRCV-like 
15 HE protein that coats the solid phase is of sufficient size to be bound by an 
anti-CRCV HE antibody, and which comprises at least one of the three 
amino acids specific for CRCV HE protein as defined above. 

Preferably, the CRCV or CRCV-like S or HE protein, or antigenic fragment 
thereof, that coats the solid phase is at least 10 amino acids in length. More 

20 preferably, the CRCV or CRCV-like S protein, or antigenic fragment 
thereof, is at least 20, or at least 30, or at least 40, or at least 50, or at least 
100, or at least 200, or at least 300, or at least 400 amino acids in length. 
The CRCV or CRCV-like S protein may be at least 500, or at least 600, or 
at least 700, or at least 800, or at least 900, or at least 1,000 amino acids in 

25 length. 

Preferably, the CRCV or CRCV-like S protein, or antigenic fragment 
thereof, that coats the solid phase is less than about 1200 amino acids in 
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length. More preferably, the CRCV or CRCV-like S protein, or antigenic 
fragment thereof, is less than about 1,100, or less than about 1,000, or less 
than about 900, or less than about 800, or less than about 700, or less than 
about 600, or less than about 500 amino acids in length. The CRCV or 
5 CRCV-like S or HE protein may be less than about 400, or less than about 
300, or less than about 200, or less than about 100, or less than about 5.0 
amino acids in length. 

Preferably, the solid phase is a microtitre well. 

Further preferably, the conjugate comprises anti-dog antibody. 

10 Preferably, the conjugate comprises an enzyme, for example horseradish 
peroxidase. Further preferably, the immunosorbent assay also comprises a 
substrate for the enzyme. 

Further details of suitable immunosorbent assays and ELISAs are provided 
above. 

15 The invention includes a kit of parts which include the components of the 
immunosorbent assay. The kit of parts may thus include a solid phase such 
as a microtitre plate, CRCV or CRCV-like S or HE protein or both for 
coating the solid phase, a detectable label conjugate, such as an anti-dog 
antibody, which will bind to anti-CRCV antibodies bound to the solid 

20 phase. If the detectable label conjugate is an enzyme, the kit of parts may 
also include a substrate for the enzyme. The kit may also include a positive 
control sample that contains an anti-CRCV S or HE protein antibody, such 
as those described with reference to the fifth aspect of the invention, and a 
negative control sample. 

25 The invention thus includes a solid substrate with a CRCV or CRCV-like S 
or HE protein as defined in the first and third aspects of the invention, or 
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obtainable using the methods of the fourth aspect of the invention, or an 
antigenic fragment thereof, attached thereto, for capturing anti-CRCV S or 
HE antibodies or both from a liquid sample, wherein anti-CRCV S or HE 
antibodies in a sample exposed to the solid substrate will bind to the S or 
5 HE protein. 

Typically, protein is coated on microtitre plates overnight at 4°C to 37°C, 
depending on the stability of the antigen. Unbound protein is washed off 
with a wash buffer such as phosphate buffered saline or Tris buffered saline. 
Serum or other samples are incubated on the plate, typically at 37°C for 

10 between 1 and several hours. Unbound material is washed off, the plates 
are incubated with enzyme-labelled (e.g. horseradish peroxidase) antibody, 
such as anti-canine IgG or IgM for serum samples, or anti-canine IgA for 
lung washes, for 1 to several hours at 37°C. Unbound antibody is washed 
off and plates are incubated with a substrate such as OPD for about 10 min, 

15 and the optical density measured in a photometer. 

Preferably, the solid substrate is a microtitre well. 

In an eighth aspect, the invention provides a vaccine composition for 
vaccinating dogs comprising (i) a coronavirus having an S protein with at 
least 75% amino acid identity with CRCV S protein, or (ii) a coronavirus 

20 having an S protein with at least 75% amino acid identity with BCV S 
protein, or (iii) a coronavirus having an HE protein with at least 90% amino 
acid identity with CRCV HE protein, or (iv) a coronavirus having an HE 
protein with at least 90% amino acid identity with BCV HE protein, or (v) a 
coronavirus protein having at least 75% amino acid identity with a CRCV 

25 protein or an immunogenic fragment thereof, or (vi) a coronavirus protein 
having at least 75% amino acid identity with a BCV protein or an 
immunogenic fragment thereof, or (vii) a nucleic acid encoding said 
coronaviral protein or immunogenic fraction thereof. 
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Preferably, the vaccine is packaged and presented for use in dogs. 

When the vaccine comprises a coronavirus protein, or an immunogenic 
fragment thereof, the protein preferably has at least 80%, or at least 85%, or 
at least 90%, or at least 95% amino acid identity with the corresponding 
5 portion of a BCV or CRCV protein. 

Preferably, the coronavirus protein is a BCV, HCV, HEV or CRCV protein, 
or a modification thereof. 

Typical protein modifications include amino acid substitutions to improve 
the antigenticity of the vaccine. BCV, HCV and HEV proteins may be 

10 modified to be more like a CRCV protein. For example, the spike protein 
of BCV, HCV or HEV may be modified to include a CRCV amino acid at 
any of differences shown in the comparison in Figure 10, or listed in Table 
1. Additionally or alternatively, the HE protein of BCV, HCV or HEV may 
be modified to include a CRCV amino acid at any of the three CRCV- 

15 specific residues as defined above. 

Proteins in which one or more of the amino acid residues are chemically 
modified, may be used providing that the function of the protein, namely the 
production of specific antibodies in vivo, remains substantially unchanged. It 
is appreciated that synthesised proteins may be suitably modified before or 
20 after their synthesised. Such modifications include forming salts with acids or 
bases, especially physiologically acceptable organic or inorganic acids and 
bases, forming an ester or amide of a terminal carboxyl group, and attaching 
amino acid protecting groups such as N-t-butoxycarbonyl. Such 
modifications may protect the peptide from in vivo metabolism. 

25 The protein may be present as single copies or as multiples, for example 
tandem repeats. Such tandem or multiple repeats may be sufficiently 
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antigenic themselves to obviate the use of a carrier. It may be advantageous 
for the protein to be formed as a loop, with the N-terminal and C-terminal 
ends joined together, or to add one or more Cys residues to an end to increase 
antigenicity and/or to allow disulphide bonds to be formed. If the protein is 
5 covalently linked to a carrier, preferably a polypeptide, then the arrangement 
is preferably such that the protein of the invention forms a loop. 

According to current immunological theories, a carrier function should be 
present in any immunogenic formulation in order to stimulate, or enhance 
stimulation of, the immune system. It is thought that the best carriers embody 

10 (or, together with the antigen, create) a T-cell epitope. The peptides may be 
associated, for example by cross-linking, with a separate carrier, such as 
serum albumins, myoglobins, bacterial toxoids and keyhole limpet 
haemocyanin. More recently developed carriers which induce T-cell help in 
the immune response include the hepatitis-B core antigen (also called the 

15 nucleocapsid protein), presumed T-cell epitopes such as Thr-Ala-Ser-Gly-Val- 
Ala-Glu-Thr-Thr-Asn-Cys (SEQ ID NO: 52), beta-galactosidase and the 163- 
171 peptide of interleukin-1. The latter compound may variously be regarded 
as a carrier or as an adjuvant or as both. Alternatively, several copies of the 
same or different proteins of the invention may be cross-linked to one another; 

20 in this situation there is no separate carrier as such, but a carrier function may 
be provided by such cross-linking. Suitable cross-linking agents include those 
listed as such in the Sigma and Pierce catalogues, for example glutaraldehyde, 
carbodiimide and succinimidyl 4-(N-maleimidomethyl)cyclohexane- 1 - 
carboxylate, the latter agent exploiting the -SH group on the C-terminal 

25 cysteine residue (if present). 

If the protein is prepared by expression of a suitable nucleotide sequence in a 
suitable host, then it may be advantageous to express it as a fusion product 
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with a peptide sequence which acts as a carrier. Kabigen's "Ecosec" system is 
an example of such an arrangement. 

It is appreciated that the coronavirus component of the vaccine may be linked 
to other antigens to provide a dual effect. 

5 Preferably, the coronavirus protein in the vaccine composition is an S 
protein. More preferably, the S protein is a CRCV or CRCV-like S protein 
as defined above in the first aspect of the invention or obtainable by the 
methods of the fourth aspect of the invention, a BCV S protein, an HCV S 
protein, an HEV S protein, or an immunogenic fragment thereof. 

10 Most preferably, the vaccine composition contains a CRCV S protein that 
comprises or consists of the amino acid sequence listed in Figure 4, or an 
immunogenic fragment thereof having at least 97% identity with the 
sequence listed in Figure 4. Preferably, the variant has at least at least 98%, 
or at least 99% amino acid sequence identity with the sequence listed in 

15 Figure 4. More preferably the variant has at least 99.1%, or at least 99.2%, 
or at least 99.3%, or at least 99.4%, or at least 99.5%, or at least 99.6%, or 
at least 99.7%, or at least 99.8%, or at least 99.9% amino acid sequence 
identity with the sequence listed in Figure 4. 

Additionally or alternatively, the vaccine composition may comprise 
20 coronavirus proteins such as a hemagglutinin-esterase protein (HE) or an 
integral membrane protein (M), or the small membrane protein (E) (Lai 
MMC & Cavanagh D, (1997) "The molecular biology of coronaviruses" 
Adv.Vir.Res, 48: 1-100). 

In one embodiment, the HE, E or M proteins are BCV, HCV or HEV 
25 proteins. In another embodiment, the HE, E or M proteins are CRCV 
proteins. 
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Preferably, the HE protein is a CRCV or CRCV-like HE protein as defined 
above in the third aspect of the invention or obtainable by the methods of 
the fourth aspect of the invention, or an immunogenic fragment thereof. 

More preferably, the vaccine composition contains a CRCV HE protein that 
5 comprises or consists of the partial amino acid sequence listed in Figure 14, 
or an immunogenic fragment thereof having at least 97% identity with the 
sequence listed in Figure 14. Preferably, the variant has at least at least 
98%, or at least 99% amino acid sequence identity with the sequence listed 
in Figure 14. More preferably the variant has at least 99.1%, or at least 
10 99.2%, or at least 99.3%, or at least 99.4%, or at least 99.5%, or at least 
99.6%, or at least 99.7%, or at least 99.8%), or at least 99.9% amino acid 
sequence identity with the sequence listed in Figure 14. 

When die vaccine comprises a coronavirus, preferably the coronavirus 
comprises an S protein with at least 80%, or at least 85%, or at least 90%, or 
15 at least 95% amino acid identity with the BCV S protein. More preferably, 
the coronavirus comprises an S protein with at least 80%, or at least 85%, or 
at least 90%, or at least 95% amino acid identity with the CRCV S protein. 

Additionally or alternatively, when the vaccine comprises a coronavirus, 
preferably the coronavirus comprises an HE protein with at least 90% or at 
20 least 95% amino acid identity with the BCV HE protein. More preferably, 
the coronavirus comprises an HE protein with at least 96%, or at least 97%, 
or at least 98%, or at least 99% amino acid identity with the CRCV HE 
protein. 

In another preferred embodiment, the vaccine composition comprises a 
25 virus selected from BCV, HCV, HEV and CRCV, or a modification thereof. 
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It is appreciated that dog vaccines effective against a canine virus may be 
derived from a non-canine virus. For example US Patent No. 5,750,112 to 
Gill, and assigned to Solvay Animal Health Inc, discloses a vaccine against 
enteric canine coronavirus containing inactivated feline enteric coronavirus. 
5 The disclosure of US 5,750, 1 1 2 is incorporated herein by reference. 

In one preferred embodiment, the virus is an inactivated virus. Methods for 
inactivating viruses for use in vaccines are well known in the art. Suitable 
methods include chemical methods, such as the use of beta proprio-lactone 
(BPL). Suitable inactivated bovine coronavirus vaccines may include 

10 inactivated BCV which is a component of bovine vaccines such as 
"Rotovec Corona" from Schering-Plough 

(http://www.ukvet.co.uk/rotovec/scour.htm); "Lactovac" by Hoechst 
Roussel Vet Ltd, (Veterinary Formulary 5th Edition of the Veterinary Data 
Sheet Compendium); "First Defense" by Immuncell Corp, USA; "Scour 

15 Bos 4" by Grand Laboraotries and "Scour Guard 3K" by Pfizer. 

In an alternative embodiment, the virus is an attenuated virus. Methods for 
attenuating viruses for use in vaccines are well known in the art. 

Preferably, the vaccine composition also comprises a pharmaceutically 
acceptable adjuvant. 

20 Preferably, when the vaccine comprises a nucleic acid, the nucleic acid 
encoding the coronaviral protein or immunogenic fraction thereof, for use 
as a vaccine is a CRCV or CRCV-like S polynucleotide, or a CRCV or 
CRCV-like HE polynucleotide or both a CRCV or CRCV-like S and HE 
polynucleotide. More preferably, the nucleic acid comprises or consists of 

25 the nucleotide sequence listed in Figure 3 or Figure 13, or fractions thereof. 
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For vaccine use, the CRCV or CRCV-like S or HE nucleic acid can be 
delivered in various replicating (e.g. recombinant adenovirus vaccine) or 
non-replicating (DNA vaccine) vectors. 

In a preferred embodiment, the vaccine may contain recombinant CRCV or 
CRCV-like S protein, as well as other immunogenic coronavirus proteins 
such as the HE protein. 

As discussed above, several viral and bacterial agents are known to be 
associated with respiratory disease in dogs, including canine parainfluenza 
virus (CPIV), canine adenovirus type 2 (CAV-2), canine herpesvirus 
(CHV), and Bordetella bronchiseptica (B. bronchiseptica). 

In another preferred embodiment, the vaccine may contain recombinant 
CRCV or CRCV-like S or HE protein, as well as other pathogenic 
organisms involved in respiratory disease of dogs such as canine 
parainfluenzavirus, canine adenovirus type 2, the bacterium Bordetella 
bronchiseptica, canine herpesvirus, human reovirus and mycoplasma 
species, or immunogenic proteins therefrom. Thus the vaccine may contain 
an agent capable of raising an immune response, such as the production of 
antibodies against CRCV, as well as against other pathogenic organisms 
involved in respiratory disease of dogs such as CPIV, CAV-2, B. 
bronchiseptica and CHV. 

In an embodiment, as well as containing an agent capable of stimulating the 
production of antibodies against CRCV, such as a CRCV or CRCV-like S 
or HE protein, the vaccine composition further comprises any one or more 
of: 

(a) an agent capable of raising an immune response in a dog 
against CPIV; 
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(b) an a; 
against CAV-2; 



gent capable of raising an immune response in a dog 



(c) an a: 
against CHV; and 



gent capable of raising an immune response in a dog 



(d) an a; 



gent capable of raising an immune response in a dog 



against B. bronchiseptica. 

Thus the vaccine composition can optionally also comprise any two, or any 
three or all four of these additional agents (a), (b), (c) and (d). 

Typically, an agent capable of raising an immune response in a dog against 
CPIV comprises inactivated or attenuated CPIV, or an immunogenic 
fragment thereof, or a nucleic acid encoding said immunogenic fraction. 

Typically, an agent capable of raising an immune response in a dog against 
CAV-2 comprises inactivated or attenuated CAV-2, or an immunogenic 
fragment thereof, or a nucleic acid encoding said immunogenic fraction. 

Canine adenovirus type 1 causes infectious hepatitis; canine adenovirus 
type 2 causes respiratory disease. It has been shown that CAV-1 provides 
cross-protection against CAV-2 and vice versa. The agent that raises an 
immune response in a dog against CAV-2 may therefore contain either 
CAV-1 or CAV-2, or an immunogenic fragment thereof. The vaccines 
listed below contain CAV-2 except for EURICAN DHPPi, which does not 
specify the virus type used. 

Suitable agents that raise an immune response in a dog against CPIV and 
CAV-2 are known to a person of skill in the art. For example, the following 
dog vaccines are licensed in the UK. 
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KAVAK DA 2 PiP69 by Fort Dodge Animal Health is a live freeze dried 
vaccine containing attenuated strains of canine distemper virus, canine 
adenovirus type 2, canine parainfluenza type 2 and canine parvovirus grown 
in tissue culture. 

5 KAVAK Parainfluenza by Fort Dodge Animal Health contains live freeze- 
dried vaccine derived from an attenuated strain of canine parainfluenza 
virus type 2 cultivated on an established homologous cell-line. 

NOBIVAC DHPPi by Intervet UK Limited is a live attenuated freeze-dried, 
virus vaccine containing canine distemper virus, canine adenovirus type 2 5 
10 canine parvovirus and canine parainfluenza virus grown in cell line tissue 
culture. 

NOBIVAC KC by Intervet UK Limited is a modified live freeze-dried 
vaccine containing Bordetella bronchiseptica strain B-C2 and canine 
parainfluenza virus strain Cornell (this is an intranasal vaccine). 
15 Management authorisation number Vm 06376/4026. 

EURICAN DHPPi by Merial Animal Health Ltd. is a combined live freeze- 
dried vaccine against canine distemper, infectious canine hepatitis, canine 
parvovirus and canine parainfluenza virus type 2. 

VANGUARD 7 by Pfizer Ltd. contains live attenuated canine distemper 
20 virus (Snyder Hill strain), adenovirus (CAV-2 Manhattan strain), 
parainfluenza virus (NL-CPI-5 strain), canine parvovirus (NL-35-D) 
propagated in an established cell line, and an inactivated culture of 
Leptospira canicola and Leptospira icterohaemorrhagiae, 

QUANTUM DOG 7 by Schering-Plough Animal Health contains canine 
25 distemper, adenovirus type 2, parvovirus, parainfluenza virus type 2 vaccine 
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(living) and inactivated Leptospira canicola and Leptospira 
icterohaemorrhagiae vaccine. 

CANIGEN DHPPi by Virbac Ltd. is a live attenuated, freeze-dried, virus 
vaccine containing canine distemper virus, canine adenovirus (CAV2), 
5 canine parvovirus and canine parainfluenza virus grown in cell line tissue 
culture. 

CANIGEN Ppi by Virbac Ltd. is a live attenuated, freeze-dried virus 
vaccine containing canine parvovirus and canine parainfluenza virus grown 
in cell line tissue culture. 

10 Typically, an agent capable of raising an immune response in a dog against 
CHV comprises inactivated or attenuated CHV, or an immunogenic 
fragment thereof, or a nucleic acid encoding said immunogenic fraction. 

Suitable agents that raise an immune response in a dog against CHV are 
known to a person of skill in the art. For example, EURICAN Herpes 205 
15 by Merial is a purified sub-unit vaccine against canine herpesvirus which is 
indicated for the active immunisation of pregnant bitches to prevent 
mortality, clinical signs and lesions in puppies resulting from canine 
herpesvirus infections acquired in the first days of life. It is not licensed for 
the vaccination of adult dogs for the prevention of respiratory disease. 

20 Typically, an agent capable of raising an immune response in a dog against 
B. bronchiseptica comprises inactivated or attenuated B. bronchiseptica, or 
an immunogenic fragment thereof, or a nucleic acid encoding said 
immunogenic fraction. 

Suitable agents that raise an immune response in a dog against B. 
25 bronchiseptica are known to a person of skill in the art. For example, the 
following dog vaccines are licensed for use. 
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COUGHGUARD-B® by Pfizer Animal Health (U.S. Vet. Lie. No.: 189) 
contains an inactivated culture of B. bronchiseptica. It is for the 
immunisation of healthy dogs against disease caused by B. bronchiseptica, 
in particular kennel cough. COUGHGUARD-B® is prepared from a highly 
5 antigenic strain of B. bronchiseptica which has been inactivated and 
processed to be nontoxic when administered to dogs. The production 
method is reported to leave the immunogenic properties of B. 
bronchiseptica intact. 

VANGUARD® 5/B by Pfizer Animal Health (U.S. Vet. Lie. No.: 189) 
10 contains attenuated strains of canine distemper virus (CDV), CAV-2, CPIV, 
and canine parvovirus (CPV) propagated on an established canine cell line. 
The CPV antigen was attenuated by low passage on the canine cell line and 
at that passage level has immunogenic properties capable of overriding 
maternal antibodies. The vaccine is packaged in lyophilised form with inert 
15 gas in place of vacuum. The bacterin component containing inactivated 
whole cultures of B, bronchiseptica which is supplied as diluent. The B. 
bronchiseptica component in VANGUARD® 5/B is prepared from a highly 
antigenic strain which has been inactivated and processed to be nontoxic 
when administered to dogs. 

20 NASAGUARD-B™ by Pfizer Animal Health (U.S. Vet. Lie. No.: 112) is 
composed of an avirulent live culture of B. bronchiseptica bacteria. 

PROGARD®-KC by Intervet is a modified live intranasal vaccine 
containing attenuated canine parainfluenza virus and Bordetella 
bronchiseptica avirulent live culture. PROGARD®-KC is presented in a 
25 desiccated form with sterile diluent provided for reconstitution. 
PROGARD®-KC is for vaccination of healthy, susceptible puppies and dogs 
for prevention of canine infectious tracheobronchitis ("kennel cough") due 
to canine parainfluenza virus and B. bronchiseptica. 
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PROG ARD®-KC PLUS by Intervet contains live culture of avirulent strains 
of B. bronchiseptica, attenuated canine adenovirus type 2 and parainfluenza 
virus for intranasal administration. Vaccination with PROGARD®-KC Plus 
stimulates rapid, local immunity in the respiratory tract, thereby inhibiting 

5 infection at the port of entry as well as preventing clinical signs. In addition 
to local immunity, it also stimulates systemic immunity within three weeks 
of intranasal administration. The small volume (0.4 ml) and one nostril 
application of PROGARD®-KC Plus provide for ease in vaccination, 
particularly in small breeds and young puppies. PROGARD®-KC Plus is 

io presented in a desiccated form with sterile diluent provided for 
reconstitution. PROGARD®-KC Plus is for vaccination of healthy dogs and 
puppies three weeks of age or older for prevention of canine infectious 
tracheobronchitis ("kennel cough") due to canine adenovirus type 2, 
parainfluenza virus and B. bronchiseptica. 

15 Intrac by Intervet is a freeze dried modified live vaccine, containing B. 
bronchiseptica strain S 55, for intranasal administration. Product licence 
number PL 0201/4011 

Nobivac KC, described above, also contains B. bronchiseptica. 

Vaccination would be useful especially but not exclusively for dogs prior to 
20 entry into a boarding kennel or for the vaccination of dogs in breeding 
facilities. 

A typical dose of a vaccine comprised of recombinant protein is about 5-10 
(ig. A typical dose of a vaccine comprised of inactivated virus is about 1-10 
mg. 

25 In a ninth aspect, the invention provides the use of (i) a coronavirus having 
an S protein with at least 75% amino acid identity with CRCV S protein, or 
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(ii) a coronavirus having an S protein with at least 75% amino acid identity 
with BCV S protein, or (iii) a coronavirus having an HE protein with at 
least 90% amino acid identity with CRCV HE protein, or (iv) a coronavirus 
having an HE protein with at least 90% amino acid identity with BCV HE 
5 protein, or (v) a coronavirus protein having at least 75% amino acid identity 
with a CRCV protein or an immunogenic fragment thereof, or (vi) a 
coronaviral protein having at least 75% amino acid identity with a BCV 
protein, or an immunogenic fragment thereof, or (vii) a nucleic acid 
encoding said coronaviral protein or immunogenic fraction thereof, in the 
10 preparation of a medicament for stimulating an immune response against 
CRCV in a dog. 

The invention includes the use of (i) a coronavirus having an S protein with 
at least 75% amino acid identity with CRCV S protein, or (ii) a coronavirus 
having an S protein with at least 75% amino acid identity with BCV S 

15 protein, or (iii) a coronavirus having an HE protein with at least 90% amino 
acid identity with CRCV HE protein, or (iv) a coronavirus having an HE 
protein with at least 90% amino acid identity with BCV HE protein, or (v) a 
coronavirus protein having at least 75% amino acid identity with a CRCV 
protein or an immunogenic fragment thereof, or (vi) a coronaviral protein 

20 having at least 75% amino acid identity with a BCV protein, or an 
immunogenic fragment thereof, or (vii) a nucleic acid encoding said 
coronaviral protein or immunogenic fraction thereof, in the preparation of a 
medicament for prophylaxis of respiratory disease in a dog, typically CIRD. 

When a coronavirus protein, or an immunogenic fragment thereof, is used 
25 in the preparation of the medicament, the protein preferably has at least 
80%, or at least 85%, or at least 90%, or at least 95% amino acid identity 
with the corresponding portion of a BCV protein. Preferably the protein has 
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at least 80%, or at least 85%, or at least 90%, or at least 95% amino acid 
identity with the corresponding portion of a CRCV protein. 

Preferably, the coronaviral protein used in the preparation of the 
medicament is a BCV, HCV, HEV or CRCV protein, or a modification 
5 thereof, as described above with reference to the eighth aspect of the 
invention. 

More preferably, the coronaviral protein used in the preparation of the 
medicament is an S protein. Yet more preferably, the S protein comprises 
an CRCV or CRCV-like S protein as defined above in the first aspect of the 
10 invention or obtainable by the methods of the fourth aspect of the invention, 
a BCV S protein, an HCV S protein, or an immunogenic fragment thereof. 

Most preferably, the coronaviral protein used in the preparation of the 
medicament comprises or consists of the amino acid sequence listed in 
Figure 4, or an immunogenic fragment thereof having at least 97% identity 

15 with the sequence listed in Figure 4. Preferably, the variant has at least 
98%, or at least 99% amino acid sequence identity with the sequence listed 
in Figure 4. More preferably the variant has at least 99.1%, or at least 
99.2%, or at least 99.3%, or at least 99.4%, or at least 99.5%, or at least 
99.6%, or at least 99.7%, or at least 99.8%, or at least 99.9% amino acid 

20 sequence identity with the sequence listed in Figure 4. 

Additionally or alternatively, the coronaviral protein used in the preparation 
of the medicament may comprise HE, E, M or N coronavirus proteins. In 
one embodiment, the HE, E, M or N proteins are BCV, HCV or HEV 
proteins. In another embodiment, the HE, E, M or N proteins are CRCV 
25 proteins. 
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Typically, the HE protein comprises an CRCV or CRCV-like HE protein as 
defined above in the third aspect of the invention or obtainable by the 
methods of the fourth aspect of the invention, a BCV HE protein, an HCV 
HE protein, or an immunogenic fragment thereof. 

Preferably, the coronaviral HE protein used in the preparation of the 
medicament comprises or consists of the partial amino acid sequence listed 
in Figure 14, or an immunogenic fragment thereof having at least 97% 
identity with the sequence listed in Figure 14. Preferably, the variant has at 
least 98%, or at least 99% amino acid sequence identity with the sequence 
listed in Figure 14. More preferably the variant has at least 99.1%, or at 
least 99.2%, or at least 99.3%, or at least 99.4%, or at least 99.5%, or at 
least 99.6%, or at least 99.7%, or at least 99.8%, or at least 99.9% amino 
acid sequence identity with the partial sequence listed in Figure 14. 

When a coronavirus is used in the preparation of the medicament, the 
coronavirus preferably comprises an S protein with at least 80%, or at least 
85%o, or at least 90%, or at least 95% amino acid identity with the BCV S 
protein. More preferably the coronavirus comprises an S protein with at 
least 80%, or at least 85%, or at least 90%, or at least 95% amino acid 
identity with the CRCV S protein. 

Additionally or alternatively, the coronavirus may comprise an HE protein 
with at least 90%, or at least 95% amino acid identity with the BCV HE 
protein. More preferably the coronavirus comprises an HE protein with at 
least 96%, or at least 97%, or at least 98%, or at least 99% amino acid 
identity with the CRCV HE protein. 

In a tenth aspect, the invention provides a CRCV or CRCV-like S protein as 
defined above in the first aspect of the invention or obtainable by the 
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methods of the fourth aspect of the invention, for use in medicine. 
Typically, the S protein will be used in veterinary medicine. 

The invention includes a CRCV or CRCV-like HE protein as defined above 
in the third aspect of the invention or obtainable by the methods of the 
fourth aspect of the invention, for use in medicine. Typically, the HE 
protein will be used in veterinary medicine. 

In an eleventh aspect, the invention provides a method of vaccinating a dog 
against CRCV, the method comprising administering to the dog a vaccine 
composition as described above in the ninth aspect of the invention. 

Typically, the vaccine will be administered via the intramuscular, 
subcutaneous or intranasal routes 

In another embodiment, a dog can passively acquire immunity against 
CRCV by being administered an antibody that reacts with CRCV. The 
antibody that reacts with CRCV may be an anti-BCV, anti-HCV antibody, 
but is preferably an anti-CRCV antibody. Preferably, the antibody, that 
reacts with CRCV is an anti-S protein antibody an anti-HE protein antibody. 
Most preferably, the antibody that reacts with CRCV is an anti-CRCV S or 
HE protein antibody as described in the fifth aspect of the invention. 

In a twelfth aspect, the invention provides a method for combating the 
spread of CRCV between dogs comprising determining whether a dog is 
infected with CRCV according to the methods as described above in the 
sixth aspect of the invention, or using the immunosorbent assay or solid 
substrate as described above in the seventh aspect of the invention, and, if 
the dog is infected with CRCV, quarantining the dog. 

By "quarantining" a dog we include the meaning of keeping the dog 
separate from all other dogs. We also include the meaning of keeping the 



WO 2004/011651 



PCT/GB2003/002832 



64 

dog separate from dogs that have not been vaccinated against CRCV, which 
can be performed as described above. We also include the meaning of 
keeping the dog separate from dogs that have not been infected by CRCV, 
which can be determined as described above. 

In a thirteenth aspect, the invention provides a method for combating the 
spread of CRCV between dogs comprising determining whether a dog is 
infected with CRCV according to the methods described above in the sixth 
aspect of the invention, or using the immunosorbent assay or solid substrate 
as described above in the seventh aspect of the invention, and, if the dog is 
infected with CRCV, vaccinating other dogs that have been, are, or are 
likely to be in contact with the dog. 

A fourteenth aspect of the invention provides a method for identifying a test 
vaccine capable of preventing or reducing the incidence of canine infectious 
respiratory disease (CIRD) in dogs. The method comprises (a) determining 
whether a dog has been exposed to CRCV, typically according to the 
methods described above in the sixth aspect of the invention or using the 
immunosorbent assay or solid substrate as described above in the seventh 
aspect of the invention, (b) if the dog has not been exposed to CRCV, 
administering the test vaccine to the dog, (c) inoculating the dog with 
CRCV, and (d) determining whether the dog develops CIRD. The absence 
of CIRD in step (d) indicates that the test vaccine is capable of preventing 
CIRD. 

Typically, this method is perfomied on a set of dogs. 

Preferably, the method involves the use of a set of control dog which are not 
administered the test vaccine in step (b). The significantly lower incidence 
of CIRD in the set of dogs that has been administered the test vaccine than 
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in the control set indicates that the test vaccine is capable of preventing or 
reducing the incidence of CIRD. 

The invention also includes a vaccine identified by this method. 

All of the documents referred to herein are incorporated herein, in their 
entirety, by reference. 

The invention will now be described in more detail with the aid of the 
following Figures and Examples. 

Figure 1 

Partial nucleotide sequence (250 residues) of the CRCV polymerase (pol) 
cDNA (SEQ ID NO: 1). 

Figure 2 

Partial amino acid sequence (83 residues) of the CRCV pol protein (SEQ ID 
NO: 2), derived from the nucleotide sequence of Figure L 

Figure 3 

Nucleotide sequence (4092 residues) of the CRCV Spike (S) cDNA (SEQ 
ID NO: 3). The Y at position 3531 refers to either C or T. 

Figure 4 

Amino acid sequence (1363 residues) of the CRCV S protein (SEQ ID NO: 
4), derived from the nucleotide sequence of Figure 3. 
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Figure 5 

Consensus tree for cDNA sequences from a 250 nucleotide region of the 
polymerase gene of 12 coronaviruses. The sequence obtained from the 
canine respiratory coronavirus is designated T101. The numbers indicate 
5 bootstrap values obtained by analysis of 1 00 data sets. 

BCV: bovine coronavirus, CCV: canine coronavirus, FIPV: feline infectious 
peritonitis virus, HEV: hemagglutinating encephalomyelitis virus, IBV: 
infectious bronchitis virus, MHV: mouse hepatitis virus, OC43: human 
coronavirus strain OC43, SDAV: sialodacryoadenitis virus, TCV: turkey 
io coronavirus, TGEV: transmissible gastroenteritis virus, 229E: human 
coronavirus strain 229E, T101: canine respiratory coronavirus (PCR 
product from tracheal sample T101) 

Figure 6 

CLUSTAL X (1.8) multiple sequence alignment of the 250 nucleotide 
15 partial sequence of the pol cDNA of CRCV (sample T101, SEQ ID NO: 1), 
BCV (SEQ ID NO: 5), HCV strain OC43 (SEQ ID NO: 6), HEV (SEQ ID 
NO: 7) and CCV (enteric CCV, SEQ ID NO: 8). 

Figure 7 

CLUSTAL X (1.8) multiple sequence alignment of the 83 amino acid 
20 partial sequence of the pol protein of CRCV (protCRCVpol, SEQ ID NO: 
2) with HCV (protHCVpoly, SEQ ID NO: 9), HEV (protHEVpoly, SEQ ID 
NO: 10), BCV (protBCVpoly, SEQ ID NO: 11) and CECV (enteric CCV, 
protCECVpol, SEQ ID NO: 12). 
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Figure 8 

CLUSTAL X (1.8) sequence alignment of the nucleotide sequence of the 
CRCV spike cDNA (CRCVspike, SEQ ID NO: 3) and enteric CCV spike 
cDNA (CECVspike, SEQ ID NO: 13). 

5 Figure 9 

CLUSTAL X (1.8) multiple sequence alignment of the 4092 nucleotides of 
the CRCV spike cDNA (CRCVspike, SEQ ID NO: 3) sequence with BCV 
(BCVspike, SEQ ID NO: 14), HCV (HCVspike, SEQ ID NO: 15) and HEV 
(HEVspike, SEQ ID NO: 16) spike cDNAs. The Y at position 3531 in the 
10 CRCV sequence refers to either C or T. 

Figure 10 

CLUSTAL X (1.8) multiple sequence alignment of the 1363 amino acid 
sequence of the CRCV spike protein (CRCVspikepr, SEQ ID NO: 4) with 
BCV (BCVspikepro, SEQ ID NO: 17), HCV (HCVspikepro, SEQ ID NO: 
15 18), HEV (HEVspikepro, SEQ ID NO: 19) and enteric CCV 
(CECVspikepr, SEQ ID NO: 20) spike proteins. 

Figure 11 

RT-PCR using nested set of primers (Spike 1 and 2 (SEQ ID NOS: 34 and 
35) followed by Spike 3 and 4 (SEQ ID NOS: 36 and 37)). BCV: Bovine 
20 coronavirus positive control sample; A72: Coronavirus negative A72 cells; 
H 2 0: PCR mix without DNA; T5 - T21: Tracheal samples of study dogs. 
The agarose gel electrophoresis shows PCR products of the expected size of 
442bp for the positive control (BCV) and samples T12 and T21. 
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Figure 12 

Comparison of the prevalence of respiratory disease in two groups of dogs. 
Dogs in group 1 were positive for serum antibodies to respiratory 
coronavirus on day of entry into the kennel, dogs in group 2 were negative. 
5 The graph shows the percentage of dogs developing respiratory disease in 
group 1 compared to group2 (p<0.001). n is the total number of dogs in 
each group. 

Figure 13 

Partial nucleotide sequence (497 residues) of the CRCV 
10 hemagglutinin/esterase (HE) gene (SEQ ID NO: 21). The sequence 
corresponds to nucleotides 418 to 914 of the HE genes of BCV (GenBank 
M84486) and HCV OC43 (GenBank Accession No. M76373). 

Figure 14 

Partial amino acid sequence (165 residues) of the CRCV HE protein (SEQ 
15 ID NO: 22), derived from the nucleotide sequence of Figure 13. This 
sequence corresponds to amino acid residues 140 to 304 of BCV (GenBank 
M84486) and HCV OC43(GenBank Accession No. M76373). 

Figure 15 

CLUSTAL X (1.8) multiple sequence alignment of a 497 nucleotide partial 
20 sequence of the hemagglutinin/esterase (HE) gene of CRCV (canine 
respiratory coronavirus, SEQ ID NO: 21) with BCV (bovine coronavirus 
strain LY138, (SEQ ID NO: 23, taken from Genbank Accession No. 
AF058942), OC43 (human coronavirus strain OC43, SEQ ID NO: 24 taken 
from Genbank Accession No. M76373), HECV (human enteric coronavirus, 
25 SEQ ID NO: 25, taken from Genbank Accession No. L07747), and HEV 
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(hemagglutinating encephalomyelitis virus, SEQ ID NO: 26, taken from 
Genbank Accession No. AF481 863). 



Figure 16 

CLUSTAL X (1.8) multiple sequence alignment of a 165 amino acid partial 
5 sequence of the HE protein of CRCV (canine respiratory coronavirus, (SEQ 
ID NO: 22) with BCV (bovine coronavirus strain LY138, SEQ ID NO: 27, 
taken from Genbank Accession No. AF058942), OC43 (human coronavirus 
strain OC43, SEQ ID NO: 28, taken from Genbank Accession No. 
M76373), HECV (human enteric coronavirus, SEQ ID NO: 29, taken from 
10 Genbank Accession No. L07747), and HEV (hemagglutinating 
encephalomyelitis virus, SEQ ID NO: 30, taken from Genbank Accession 
No. AF481863). The three CRCV-specific amino acids F, N and L are 
indicated in bold and are underlined. 



Figure 17 

15 RT-PCR using consensus primers HE1 (SEQ ID NO: 38) and HE2 (SEQ ID 
NO: 39) directed to the HE gene of BCV and HCV (strain OC43). The 
agarose gel electrophoresis shows a PCR product of the expected size of 
497bp for the BCV positive control and for four tracheal samples from 
study dogs (T90, T91, T101 and T105), and not for coronavirus-negative 

20 A72 cells or the PCR mix without DNA (H 2 0). 1 kb indicates a molecular 
size standard (Promega). 

Figure 18 



CRCV Spike gene cloning strategy. 
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Example 1: Detection of a novel coronavirus associated with canine 
infectious respiratory disease 

Summary 

An investigation into the causes of canine infectious respiratory disease 
5 (CIRD) was carried out in a large re-homing kennel. * Tissue samples taken 
from the respiratory tract of diseased dogs were tested for the presence of 
coronaviruses using RT-PCR with conserved primers for the polymerase 
gene. Sequence analysis of four positive samples showed the presence of a 
novel coronavirus with high similarity to both bovine and human 
10 coronavirus (strain OC43) in their polymerase and spike genes whereas 
there was a low similarity to comparable genes in the enteric canine 
coronavirus. This canine respiratory coronavirus (CRCV) was detected by 
RT-PCR in 32/119 tracheal and 20/119 lung samples with the highest 
prevalence being detected in dogs with mild clinical symptoms. Serological 
15 analysis showed that the presence of antibodies against CRCV on the day of 
entry into the kennel decreased the risk of developing respiratory disease. 

Materials and Methods 

Study population 

Dogs from a well-established re-homing kennel with a history of endemic 
20 respiratory disease were monitored for this study. On entry into the kennel, 
all dogs were vaccinated with KAVAK DA 2 PiP69 (Fort Dodge) a live 
attenuated vaccine for distemper virus, canine adenovirus type 2, canine 
parainfluenzavirus and canine parvovirus. Also, a killed leptospirosis 
vaccine was used (Fort Dodge). The health status of each dog was assessed 
25 twice a day by a veterinary clinician and the respiratory symptoms were 
graded as follows: 1: no respiratory signs, 2: mild cough, 3: cough and nasal 
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discharge, 4: cough, nasal discharge and inappetence, 5: 
bronchopneumonia. The overall health status of the dogs was graded as 
follows: 1 : good health, 2: poor health, 3: very poor health. The age, breed 
and sex of the dogs were recorded. 

5 For 119 dogs a full post mortem examination was performed. The tissue 
samples were stored at -70°C until further use. 

Serum samples were collected from 111 dogs on day of entry into the re- 
homing kennel. For 81 dogs a follow-up serum was available on day 7 and 
for 1 1 1 dogs a serum was available on day 21 after entry. 

10 Of the 111 dogs, 30 remained healthy during the 21 days between the first 
and the last serum sample whereas 81 dogs developed respiratory disease. 

Sera from 35 dogs housed elsewhere were obtained from the diagnostic 
service of the Royal Veterinary College. These sera had been submitted for 
biochemical analysis for various reasons. Five of these sera were from 18- 
15 month-old beagles with no history of respiratory disease. Sera were 
routinely stored at-20°C. 

RNA extraction and RT-PCR 

RNA was extracted from tracheal and lung tissue of 119 dogs using 
TriReagent (Sigma). Approximately 25-50 mg of homogenised tissue was 
20 used and RNA was extracted as recommended by the manufacturer. 

Synthesis of cDNA was performed using Random Hexamers (Roche) and 
ImPromll reverse transcriptase (Promega). 

The polymerase gene of coronaviruses is known to be highly conserved, and 
has previously been used for phylogenetic analysis of this virus family 
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(Stephensen et ah, 1999). For the detection of coronaviruses a modification 
of the primers 2Bp and 4Bm directed against the polymerase gene as 
described by Stephensen et al (1999) were used (ConscoroS: 5' -ACT- 
CAR- ATG-AAT-TTG-AAA-TAT-GC (SEQ ID NO: 31); and Conscoro6: 
5 5' -TCA-CAC-TTA-GGA-TAR-TCC-CA (SEQ ID NO: 32)). 

PGR was performed using Taq polymerase (Promega) in the provided 
reaction buffer containing a final concentration of 2.5 mM MgCl 2 and 
0.5|llM of primers. For PCR with the primers ConscoroS and Conscoro6 the 
following temperature profile was used: After denaturation at 95°C for 5 
10 min 5 10 cycles were carried out at 95°C for 1 min, annealing at 37°C for 1 
min and extension at 72°C for lmin. This was followed by 10 cycles using 
an annealing temperature of 45°C, 10 cycles at an annealing temperature of 
50°C and 10 cycles at an annealing temperature of 53°C followed by a final 
extension at 72° C for 10 min. 

15 A 20^1 fraction of the PCR product was analysed on a 1.5% agarose gel and 
blotted onto a nylon membrane (Roche) after electrophoresis. The nylon 
membrane was hybridised with an oligonucleotide probe specific for the 
PCR product at 37°C overnight (Probe Conscoro: AAG-TTT-TAT-GGY- 
GGY-TGG-GA (SEQ ID NO: 33)). The probe was 3'A-tailed with 

20 Digoxigenin-dUTP and was detected using anti-Digoxigenin conjugate and 
CSPD chemoluminescent substrate (Roche). 

Primer sequences specific for the spike gene were derived from an 
alignment of the spike region of bovine coronavirus strain LY-138 
(AF058942) and human coronavirus strain OC43 (LI 4643). 

25 A PCR was performed with the primers Spike 1 and Spike 2, followed by a 
nested PCR using the primers Spike 3 and Spike 4 and 2\x\ of the product of 
the first amplification. 
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The numbers in brackets refer to the nucleotide position in the bovine 
coronavirus genome. 

Spike 1: 5 5 -CTT-ATA-AGT-GCC-CCC-AAA-CTA^AAT (25291-25314) 
Spike 2: 5' -CCT-ACT-GTG-AGA-TCA-CAT-GTT-TG (25912-25890) 
5 Spike 3: 5 5 -GTT-GGC-ATA-GGT-GAG-CAC-CTG (25320-25339) 
Spike 4: 5 5 -GCA-ATG-CTG-GTT-CGG-AAG-AG (25762-25742) 

Oligonucleotide Spike 1 has SEQ ID NO: 34, Spike 2 has SEQ ID NO: 35, 
Spike 3 has SEQ ID NO: 36, Spike 4 has SEQ ID NO: 37. 

The temperature profile used was denaturation at 95 °C for 5 min, followed 
10 by 35 cycles of denaturation at 95°C for 1 min, annealing at 55°C for 40 sec 
and elongation at 72°C for 1 min. The final extension was performed at 
72°C for 10 min. The nested PCR produced a 442bp fragment. 

PCR products were cloned into the pGEM-T-easy vector (Promega) and 
sequenced using the Thermo sequenase fluorescent labelled primer cycle 
15 sequencing kit with 7-deaza-dGTP (Amersham Pharmacia) using Cy5 
labelled primers. 

Phylogenetic analysis 

An alignment of the 250 bp cDNA sequence from the polymerase gene to 
the corresponding sequences of 11 coronaviruses was performed using 
20 ClustalX (Thompson et al, 1997). 

The phylogenetic relationship to known coronaviruses was analysed using 
the Phylip 3.6 package (Felsenstein, 1989). The alignments were followed 
by a bootstrap analysis using the Seqboot programme. The obtained data 
sets were used for a maximum parsimony analysis using the DNApars 
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programme and a consensus tree was calculated using Consense. The 
resulting trees were drawn using the Treeview programme (Page, 1996). 

ELISA 

ELISA antigen for bovine coronavirus or enteric canine coronavirus 
5 (CECV) (the antigens are a preparation from virus infected cell cultures 
obtained from Churchill Applied Biosciences, Huntingdon, UK) was 
resuspended in PBS at the concentration recommended by the manufacturer 
and incubated on 96 well plates (Falcon) overnight at 37°C. 

The plates were washed with PBS and blocked with PBS containing 5% 
10 skimmed milk powder for 30 min. The sera were diluted 1:100 in blocking 
buffer and incubated on the plates for lh. After washing with PBS/ 0.05% 
Tween 20 (Sigma), a peroxidase labelled rabbit anti-dog IgG conjugate 
(Sigma) was added (1:5000 in PBS/0.05% Tween 20) for 1 h. The plates 
were incubated with colour substrate (OPD, Sigma) for 10 min and the 
15 reaction was stopped by adding 2M H 2 S0 4 . The adsorption was determined 
in an ELISA photometer at 492nm. 

Virus culture 

Virus isolation is performed on canine adult lung fibroblasts (passage 3 to 
7), MDCK and A72 cells. (It is appreciated, however, that virus isolation 

20 could be performed using primary cells or cell lines such as MDCK or A72 
(canine), MDBK (bovine), HRT-1 8 (human rectal tumour cell line) and 
Vero (African Green Monkey). The lung fibroblasts are maintained in 
MEM with 20% fetal calf serum (FCS), MDCK and A72 cells are 
maintained in MEM with 5% FCS. Tracheal tissue samples (approx. 25mg) 

25 are homogenised using a scalpel and mixed vigorously in 1ml MEM 
containing Penicillin (lOOU/ml), Streptomycin (O.lmg/ml), Amphotericin B 
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(2.5(ig/ml) and Trypsin (lfig/ml). The samples are centrifuged at 13000 
rpm for 10 min. and the supernatant is used to inoculate cell cultures. After 
30 min. at 37°C the supernatant is removed and maintenance medium added 
to the cultures. The cultures are passaged three times in the absence of a 
5 cytopathic effect. Then, RNA is extracted from the cells and RT-PCR to 
detect the presence of CRCV is performed. 

Statistical analysis 

The data were analysed using the chi-square test or Fisher's exact test and p 
values below 0.05 were considered statistically significant. 

10 Results 

PCR using consensus primers for the coronavirus RNA polymerase 
gene 

Using the primers ConscoroS and Conscoro6, cDNA obtained from 40 
tracheal samples was analysed by RT-PCR. 

15 Out of these, seven were found to be positive by PCR and subsequent 
hybridisation (17.5%). 

The PCR products were cloned and sequenced (Figures 1 and 2) and the 
sequence data were compared to available viral sequences using the FASTA 
search program (Pearson, 1990). 

20 Comparison of the coronavirus cDNA polymerase sequence obtained from 
four of the canine tracheal samples to other coronavirus sequences revealed 
that they were most similar to sequence data from BCV strain Quebec and 
LY138 (Genbank Accession Nos. AF220295 and AF058942, respectively) 
and human coronavirus strain OC43 (Genbank Accession No. AF124989). 
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The similarity in the analysed 250 bp sequence was 98.8% for BCV 
Quebec, and 98.4% for BCV LY138 and the HCV pol genes, whereas it was 
only 68.53% for CCV strain 1-71 pol gene (Figures 6 and 7). 

An alignment of the novel sequence with the corresponding sequences of 11 
5 coronaviruses and phylogenetic analysis using the maximum parsimony 
method resulted in the consensus tree shown in Figure 5. The cDNA 
sequence obtained from a tracheal sample (T101) was found on a common 
branch with bovine coronavirus, human coronavirus-OC43 and 
hemagglutinating encephalomyelitis virus. 

10 The virus was called canine respiratory coronavirus (CRCV). 

PCR using primers for the spike gene 

For further analysis of the RNA sequence of CRCV, an alignment of the 
RNA for the spike gene of the bovine coronavirus LY 138 strain 
(AF058942) and the human coronavirus OC43 strain (LI 4643) was 
15 performed using Clustal X (Thompson et aL, 1997). Consensus regions 
were chosen for the selection of the nested primer sets Spike 1-2 and Spike 
3-4 (Figure 11). PCR analysis was performed with the cDNA obtained 
from 119 tracheal and lung samples using these nested primers. 

In total 32 tracheal samples (26.9%) and 20 lung samples (16.8%) were 
20 found positive by nested PCR. For eight dogs a positive PCR result was 
obtained for both, trachea and lung. 

Sequence analysis of the PCR products obtained from tissues of six 
different dogs showed identical DNA sequences for these cDNAs (Figures 3 
and 4). A comparison to known coronavirus spike sequences using the 
25 FASTA program revealed a 98.1% similarity to bovine coronavirus and a 
97.8% similarity to human coronavirus OC43 (Figures 9 and 10). 
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PCR using primers for the HE gene 

Bovine coronavirus and other gorup II coronaviruses contain an additional 
structural protein, the hemagglutinin/esterase (HE). Because of the high 
similarity of CRCV with BCV, we analysed the presence of an HE gene in 
CRCV. 

An alignment of the HE genes sequences of BCV and HCv OC43 was used 
to design the primers HE1 and HE2 (Table 2). Four tracheal samples that 
had previously been identified as positive for coronavirus RNA by RT-PCR 
with primers for the S gene were tested by RT-PCR with the primer set for 
the HE gene. All four samples showed a PCR band of the expected size 
after agarose gel electrophoresis (Figure 17). 



Table 2: Primers designed from an alignment of the 
hemagglutinin/esterase genes of BCV (GenBank Accession No. 
M84486) and HCV OC43 (GenBank Accession No. M76373) 



Name 


Sequence 


Location in BCV 
HE gene 


HE 1 


5 ' -TAT-CGC-AGC-CTT-ACT-TTT-GT 


418-437 


HE 2 


5 ' -ACC-GCC-GTC-ATG-TTA-TC A-G 


914-896 



Primer HE1 has SEQ ID No: 38 and HE2 has SEQ ID No: 39. The 
sequence of the CRCV PCR product obtained using primers HE 1 and HE 2 
is given in Figure 13 (SEQ ID No: 21), and its predicted amino acid 
sequence is listed in Figure 14 (SEQ ID No: 22). A comparison of these 
nucleotide and amino acid sequences with the corresponding fragments of 
other related coronaviruses is shown in Figures 15 and 16. Three amino 
acids were shown to be unique to CRCV, as shown in Table 3. 
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Table 3: Unique amino acids in CRCV HE gene 



Amino acid 
in CRCV 


Amino acid in BCV/ 
HECV/HCV/HEV 


Position in BCV/ 
HECV/HCV/HEV 


Position in PCR 
product HE1-HE2 


F(Phe) 


L (Leu) 


235 


96 


N (Asn) 


T(Thr) 


242 


103 


L (Leu) 


V (Val) 


253 


114 



The amino acid positions in BCV, HECV, HCV and HEV are numbered from 
the initial M (which is number 1) at the start of the BCV and HCV OC43 
HE proteins (GenBank Accession Nos. M84486 and M76373, respectively). 



5 Association of PCR positive samples with respiratory signs 

Using primers for the spike gene, tracheal and lung samples from 119 dogs 
were analysed by RT-PCR for CRCV. Of these 42 were from dogs with no 
respiratory signs (grade 1), 18 dogs had shown mild respiratory signs (grade 
2), 46 had shown moderate (grade 3) and 13 severe respiratory signs (grades 
10 4 and 5). Grades 4 and 5 were merged due to the low case numbers in these 
groups. 

Table 4 shows the PCR results for coronavirus in dogs with different grades 
of respiratory disease. Specifically, Table 4 shows the RT-PCR results 
from tracheal and lung samples of 1 19 dogs with different respiratory signs 
15 (none to severe) using a nested PCR directed against the coronavirus spike 
gene as well as the number of positive samples out of total sample number 
and the percentage of positive samples (in brackets). 
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Table 4: RTVPCR results for tracheal and lung samples 



Respiratory signs 


Trachea: 
Positive samples 


Lung 

Positive samples 


Trachea and lung 
Positive samples 


None 


11/42(26.2%) 


8/42(19.1%) 


2/42 


Mild 


10/18 (55.6%) 


4/18(22.2%) 


4/18 


Moderate 


9/46 (19.6%) 


8/46 (17.4%) 


2/46 


Severe 


2/13 (15.4%) 


0/13 


0/13 



Establishment of a serological assay for CRCV 

Because of the homology of the spike cDNA of CRCV to the spike region 
5 of bovine coronavirus, an ELISA antigen for BCV was used for serological 
analysis of CRCV. 

Sera from five dogs with no history of infectious respiratory disease that 
had not been housed in the investigated kennel were tested. The OD values 
ranged from -0.013 to 0.39 with an average OD value of 0.154. 
10 Furthermore, sera from 30 dogs admitted to a veterinary clinic for various 
reasons were tested for antibodies to coronavirus. Of these, 20 samples 
showed an OD of <0.4 (-0.46 to 0.396) and 10 samples showed an OD of 
>1.0 (1.012 to 1.949). Samples with an OD of 0.6 or above were 
subsequently considered positive. 
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Comparison of the immune response to CRCV of dogs with and 
without respiratory disease 

The BCV-antigen ELISA was performed using paired sera of 111 dogs 
from the study kennel. Of these, 81 dogs had shown symptoms of 
5 respiratory disease during a period of 21 days and 30 had remained healthy. 

Of the group of dogs with respiratory disease, 17 were positive for 
antibodies to CRCV on the day of entry into the kennel and 64 were 
negative. 

Of the 64 dogs with no detectable antibodies to BCV on day one, 63 tested 
10 positive on day 21 . All 46 dogs out of these 63 for which a sample on day 7 
was available tested negative on day 7. Therefore 63 dogs showed a 
seroconversion during the study-period whereas only one dog remained 
negative. 

Of the 31 dogs that had remained healthy, 17 had antibodies to CRCV on 
15 the day of entry. All of the 13 dogs that were negative on day 1 tested 
negative on day 7 but showed a seroconversion by day 21. 

Thus, of 34 dogs that were positive for antibodies to CRCV on arrival in the 
kennel, 17 developed respiratory disease (50%) whereas of 77 dogs that 
were negative on arrival, 64 developed respiratory signs during the study- 
20 period (83.1%), (Figure 12). 

Therefore dogs that had no antibodies to CRCV on entry into the kennel had 
an increased probability of developing respiratory disease (p<0.001). 

Only one out of the 77 dogs that were negative on arrival remained negative 
during the study period of 21 days whereas 76 dogs showed a 
25 seroconversion. 
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Serology using canine enteric coronavirus (CECV) antigen 

An ELISA assay using a canine coronavirus antigen was performed to 
investigate whether CRCV showed a serological cross reaction to canine 
enteric coronavirus. Sera from 27 dogs, previously tested for antibodies to 
5 CRCV using the BCV antigen were selected. 

It was found that eight dogs had antibodies to CECV on the day of entry 
into the kennel, of these four also had antibodies to CRCV. Nineteen dogs 
were found to be negative for CECV on day 1, 17 of these were also 
negative for CRCV. Of the 19 negative dogs, five showed a seroconversion 
10 to CECV during the 21 -day period of the investigation and 17 showed a 
seroconversion to CRCV. 

Analysis of the prevalence of respiratory disease in this group showed that 
six out of the eight dogs (75%) that were positive for antibodies to CECV 
on day 1 developed respiratory disease. Out of the group of 19 dogs that had 
15 no detectable antibodies to CECV on day 1, 15 showed signs of respiratory 
disease (78.9%), (p=0.S94). 

Virus isolation 

Tracheal tissue samples from dogs that are identified as positive for CRCV 
RNA by RT-PCR are inoculated on cell cultures of canine adult lung 
20 fibroblasts and MDCK cells. For some samples, virus isolation is also 
performed on A72 cells. The cultures show no signs of a cytopathic effect 
during three passages. After several passage, RNA is extracted from the 
cultures and tested for the presence of CRCV RNA by RT-PCR. 
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Discussion 

This study reports the detection of a novel coronavirus, CRCV, in kennelled 
dogs with respiratory disease. 

Coronaviruses have been reported to cause respiratory disease of man, 
5 cattle, swine and poultry, but their presence in the respiratory tract of dogs 
and a possible association with canine infectious respiratory disease (CIRD) 
has not been determined. 

Dogs were investigated from a kennel in which CIRD was endemic and 
could not be controlled by the use of vaccines recommended against CIRD. 
10 Samples taken from the respiratory tract of these dogs were examined using 
RT-PCR primers directed to the conserved polymerase gene of 
coronaviruses (Stephensen et aL, 1999). 

Initially, seven tracheal samples were found to be positive; the sequence of 
the RT-PCR products was determined and compared to all available 
15 coronavirus polymerase gene sequences. This analysis revealed that the 
cDNA sequence obtained from the canine samples had the highest similarity 
to the polymerase gene of bovine coronavirus (98.8%) and human 
coronavirus OC43 (98.4%) but only a very low similarity to the polymerase 
gene of the enteric canine coronavirus (strain 1-71, 68.53% similarity). 

20 A phylogenetic analysis was performed using the polymerase sequences of 
eleven additional coronaviruses. The coronavirus detected in the respiratory 
tract of dogs (CRCV) was located on a common branch with three group 2 
viruses: BCV, HCV strain OC43 and HEV. However, canine enteric 
coronavirus, a group 1 coronavirus, was shown to be only distantly related. 
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Canine respiratory coronavirus therefore is a novel coronavirus of dogs that 
is most closely related to BCV and HCV-OC43, both of which are known to 
cause respiratory disease. 

To obtain more sequence information and to further determine the 
5 relationship to other coronaviruses using a more variable gene, a part of the 
spike gene was analysed. Since CRCV had been shown to be most similar, 
to BCV and HCV-OC43, an alignment of the sequences of their spike genes 
was used to design a nested set of primers. Nested primers were chosen to 
achieve a more sensitive assay. 

10 Sequencing of the products of this RT-PCR confirmed the high similarity of 
CRCV with BCV and HCV-OC43. 

The presence of antibodies to CRCV was analysed using an ELISA based 
on a BCV antigen because of the high sequence similarity of the two viruses 
in the spike cDNA. The ELISA results confirmed the presence of a virus 
15 similar to BCV in the study population. 

The prevalence of antibodies was 30% at the time of entry into the kennel 
and 99% after 21 days. 

Interestingly and unexpectedly, serological analysis revealed that dogs with 
antibodies to CRCV on day of entry into the kennel developed respiratory 
20 disease less frequently than dogs without antibodies (pO.OOl). Therefore 
the presence of antibodies to CRCV had a protective effect against 
respiratory disease in this population. 

Almost all dogs negative on day of entry into the kennel showed a 
seroconversion to CRCV within three weeks, indicating that the virus is 
25 highly contagious. Serology using an antigen for canine enteric coronavirus 
(CECV) showed a much lower prevalence of antibodies to CECV on day 
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21. Therefore the BCV-ELISA results did not reflect an infection with 
canine enteric coronavirus and the cross-reactivity between the two antigens 
seems to be low. 

Serum antibodies to CRCV were present in about 30% of dogs of various 
5 origins including dogs entering a re-homing kennel as well as pet dogs. The 
presence of CRCV is therefore not limited to the investigated kennel and the 
virus seems to be established in the dog population. 

By PCR, CRCV was detected in tracheal tissue and lung tissue and 
therefore appears to infect the upper and lower respiratory tract of dogs. 
10 Within the kennelled population, CRCV-RNA was detected in 27.3% of 
dogs with all grades of respiratory disease as well as in 26.2% of dogs that 
were apparently healthy at the time of euthanasia. 

CRCV-RNA was most frequently found in the trachea of dogs with mild 
cough (55%). Studies using the human coronavirus strain 229E have shown, 

15 that coronaviruses can cause disruption of the respiratory epithelium and . 
ciliary dyskinesia (Chilvers et aL,2001). Without being bound by theory, 
we believe that an infection with CRCV has a similar effect, and that the 
virus plays an important role in the early stages of the pathogenesis of 
CIRD. By damaging the respiratory epithelium and disrupting ciliary 

20 clearance CRCV facilitates the entry of other viral or bacterial pathogens. 
Therefore while CRCV infection on its own may cause only mild 
respiratory symptoms, in conjunction with other pathogenic agents it could 
lead to severe respiratory disease. 

The pathogenesis of CIRD has not been thoroughly investigated since the 
25 1970s when Bordetella bronchiseptica, canine adenovirus type 2 and canine 
parainfluenza were determined to be the main causes of the disease. 
However the vaccination of all dogs against CPIV, CAV-2 and distemper 
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virus did not help to control the disease in this kennel despite evidence that 
the majority of dogs responded to the vaccine within 21 days (data not 
shown). 

This study shows an association of a novel canine respiratory coronavirus 
5 with CIRD. The aetiology of CIRD therefore needs to be re-evaluated and 
the role of novel microorganisms or microorganisms previously not 
associated with the disease has to be established. 
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Example 2: Cloning and expression of CRCV Spike 

The CRCV Spike gene was cloned using the primers listed in Table 5 and 
using the following cloning strategy, which is illustrated in Figure 18. 

1. The spike gene was amplified in four overlapping fragments 
5 (A,B,C,D). 

2. The PCR product Sp5-Sp2 (B) was joined to the product Spl-Sp8 (C) 
using the Pvull site in the overlap. 

3. This fragment was cloned into the pT7blue2 vector (Novagen) using 
the restriction sites Ncol andifa^XL 

10 4. The PCR fragment SpFXho-Sp6 (A) was joined to BC using the 
restriction site BstXl in the overlap and the XJiol site that had been 
incorporated into the primer SpF-Xho. 

5. Fragment ABC was moved into the baculovirus transfer vector 
pMelBacB (Invitrogen) using the restriction sites XIiol and Ncol. 

15 6. The PCR fragment Sp7-SpR-HisTag- Eco (D) was joined to ABC 
using the restriction site Ncol in the overlap and the EcoKL site that 
had been incorporated into the primer SpR-Eco-HisTag resulting in the 
complete spike gene in pMelBacB (Spike MelBac). This construct 
contains a HisTag (6xHis) at the C terminus of the expressed protein. 

20 7. For mammalian expression the complete gene was moved to 
pSecTagA (Invitrogen) using the BamUl site in pMelBacB and the 
EcoKl site at the end of ABCD resulting in the plasmid SpikeSecTag. 
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Construction of a recombinant baculovirus 

A co-transfection was performed in Sf9 cells using the Bac-N-Blue 
baculovirus DNA (Invitrogen) and Spike MelBac. The resulting 
baculovirus (AcSpCRCV 1-11) was shown to contain a full-length insert by 
PCR using primers (Invitrogen) located upstream and downstream of the 
recombination site. 

Expression in mammalian cells 

The plasmid Spike SecTag was transfected into BHK-21 cells using 
Lipofectamine (Invitrogen). Expression of the Spike protein was analysed 
using a serum sample from a dog that had been shown to be positive for 
antibodies to CRCV using ELISA (BCV antigen obtained from Churchill) 
and a positive control serum for BCV obtained from Churchill (chicken anti 
BCV). The transfected cells showed a positive signal in an 
immunofluorescence assay using the canine or the chicken serum and a 
FITC labelled conjugate (FITC anti-dog IgG or FITC anti Chicken IgG). 
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CLAIMS 

1 . A coronavirus Spike (S) protein, or fragment thereof, having at least 
75% amino acid sequence identity with the CRCV S protein whose amino 

5 acid sequence is listed in Figure 4, and comprising at least one of the canine 
respiratory coronavirus (CRCV)-specific amino acids listed in Table 1 . 

2. A coronavirus S protein that comprises the amino acid sequence 
listed in Figure 4, or a variant thereof with at least 97% amino acid 

10 sequence identity with the sequence listed in Figure 4. 

3. A coronavirus polymerase (pol) protein, or fragment thereof, having 
at least 90% amino acid sequence identity with BCV pol protein and 
comprising the amino acid E at the position corresponding to position 4975 

15 in the BCV genome (Accession No. SWALL: Q91A29). 

4. A coronavirus pol protein that comprises the amino acid sequence 
listed in Figure 2. 

20 5. A coronavirus hemagglutinin/esterase (HE) protein, or fragment 
thereof, having at least 90% amino acid sequence identity with BCV HE 
protein and comprising one or more of F at position 235, N at position 242 
and L at position 253 of the BCV HE protein (Genbank Accession No. 
AF058942). 

25 

6. A coronavirus HE protein that comprises the amino acid sequence 
listed in Figure 14. 
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7. A polynucleotide that encodes the protein according to any of Claims 
1 to 6, or the complement thereof. 

8. A polynucleotide according to Claim 7 comprising the nucleotide 
5 sequence listed in Figure 3 . 

9. A polynucleotide according to Claim 7 comprising the nucleotide 
sequence listed in Figure 1 . 

10 10. A polynucleotide according to Claim 7 comprising the nucleotide 
sequence listed in Figure 13. 

11. A vector comprising the polynucleotide of any of Claims 7 to 10. 
15 12. A host cell comprising the vector of Claim 1 1 . 

13. A method of obtaining a protein encoded by the vector of Claim 11, 
the method comprising culturing the host cell of Claim 12, expressing the 
protein in the host cell, and purifying the protein. 

20 

14. A protein obtainable by the method of Claim 1 3 . 

15. A host cell according to Claim 12 wherein the vector is an expression 
vector comprising a eukaryotic promoter operatively linked to the 

25 polynucleotide, and wherein the host cell is a eukaryotic host cell. 

16. A method of obtaining a glycosylated S protein encoded by the 
polynucleotide of Claim 7 or 8, the method comprising culturing the host 
cell of Claim 15, expressing the protein in the host cell, and purifying the 

30 protein. 
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17. A glycosylated S protein obtainable by the method of Claim 16. 

18. A method of making an anti-CRCV antibody comprising (i) raising 
5 an immune response to an S protein according to any of Claims 1,2, 14 or 

17 in an animal and preparing an antibody from the animal or from an 
immortal cell derived therefrom, or (ii) selecting an antibody from an 
antibody-display library using an S protein according to any of Claims 1 , 2, 
14 or 17. 

10 

19. A method according to Claim 18 further comprising determining 
whether the antibody has greater affinity for the CRCV S protein than for 
the BCV S protein. 

15 20. An anti-CRCV antibody obtainable by the method of Claim 18 or 19 
, that has greater affinity for the CRCV S protein than for the BCV S 
protein. 

21 . A method of making an anti-CRCV antibody comprising (i) raising 
20 an immune response to an HE protein according to Claim 5 or 6 in an 
animal and preparing an antibody from the animal or from an immortal cell 
derived therefrom, or (ii) selecting an antibody from an antibody-display 
library using an HE protein according to Claim 5 or 6. 

25 22. A method according to Claim 21 further comprising determining 
whether the antibody has greater affinity for the CRCV HE protein than for 
the BCV HE protein. 
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23 . An anti-CRCV antibody obtainable by the method of Claim 2 1 or 22, 
that has greater affinity for the CRCV HE protein than for the BCV HE 
protein. 

24. A method of determining whether a dog has been exposed to CRCV, 
the method comprising: 

(a) obtaining a suitable sample from the dog; and 

(b) identifying CRCV or an anti-CRCV antibody in the sample. 

25. A method according to Claim 24 or 25 wherein the anti-CRCV 
antibody is detected using a CRCV, BCV, human coronavirus (HCV) or 
hemagglutinating encephalomyelitis virus (HEV) antigen. 

26. A method according to Claim 25 wherein the suitable sample is an 
antibody containing sample such as serum, saliva, tracheal wash and 
bronchiolar lavage, and wherein identifying an anti-CRCV antibody 
comprises identifying an antibody that selectively binds to BCV S protein 
(AF058942) HCV S protein (LI 4643), or to a coronavirus having an S 
protein with at least 75% amino acid identity with CRCV S protein (Figure 
4) or a fragment thereof. 

27. A method according to Claim 25 wherein identifying an anti-CRCV 
antibody comprises identifying an antibody that selectively binds to BCV 
HE protein (AF058942) HCV HE protein (M76373), or to a coronavirus 
having an HE protein with at least 90% amino acid identity with CRCV HE 
protein (Figure 14) or a fragment thereof. 

28. A method according to Claim 24 wherein the suitable sample is an 
antibody containing sample, and wherein identifying an anti-CRCV 
antibody comprises identifying an antibody that selectively binds to an S 
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protein according to any of Claims 1, 2, 11 or 14 or to an HE protein 
according to Claim 5 or 6. 

29. A method according to Claim 24 wherein the suitable sample is a 
5 lung wash, tracheal wash, tonsilar swab or a biopsy or post-mortem sample 

from the respiratory tract of the dog. 

30. A method according to Claim 29 wherein identifying CRCV 
comprises identifying a nucleic acid component of CRCV. 

10 

31. A method according to Claim 30 wherein identifying a nucleic acid 
component of CRCV comprises identifying a polynucleotide that hybridises 
at high stringency to the BCV genome (AF058942). 

15 32. A method according to Claim 30 or 31 wherein identifying CRCV 
comprises identifying a polynucleotide as defined in any of Claims 7 to 10. 

33. A method according to Claim 29 wherein identifying CRCV 
comprises identifying a protein component of CRCV. 

20 

34. A method according to Claim 33 wherein identifying a protein 
component of CRCV comprises identifying a protein according to any of 
Claims 1, 2, 5, 6, 1 1 or 14. 

25 35. A method according to Claim 33 or 34 wherein identifying a protein 
component of CRCV comprises using an antibody reactive with CRCV. 

36. A method according to Claim 35 wherein the antibody reactive with 
CRCV is an anti*BCV antibody, an anti-HCV antibody, or an anti-CRCV 
30 antibody according to Claim 20 or 23. 
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37. An immunosorbent assay for detecting anti-CRCV S antibodies, the 
assay comprising: 

a solid phase coated with an S protein according to any of Claims 1, 
5 2, 14 or 17, or an antigenic fragment thereof, wherein anti-CRCV S 
antibodies in a sample exposed to the solid phase will bind to the protein; 
and 

a detectable label conjugate which will bind to the anti-CRCV 
antibodies bound to the solid phase. 

10 

38. An immunosorbent assay for detecting anti-CRCV HE antibodies, 
the assay comprising: 

a solid phase coated with an HE protein according to Claim 5 or 6, or 
an antigenic fragment thereof, wherein anti-CRCV HE antibodies in a 
15 sample exposed to the solid phase will bind to the protein; and 

a detectable label conjugate which will bind to the anti-CRCV 
antibodies bound to the solid phase. 

39. An immunosorbent assay according to Claim 37 or 38, wherein the 
20 solid phase is a microtitre well. 

40. An immunosorbent assay according to Claim 37 or 38, wherein the 
conjugate comprises anti-dog antibody. 

25 41. An immunosorbent assay according to any of Claims 37 to 40, 
wherein the conjugate comprises an enzyme. 

42. An immunosorbent assay according to Claim 41, wherein the 
enzyme is horseradish peroxidase. 

30 
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43. An immunosorbent assay according to Claim 41 or 42, further 
comprising a substrate for the enzyme. 

44. A solid substrate with an S protein according to any of Claims 1, 2, 
14 or 17, or an antigenic fragment thereof, attached thereto, for capturing 
anti-CRCV S antibodies from a liquid sample, wherein anti-CRCV S 
antibodies in a sample exposed to the solid substrate will bind to the S 
protein. 

45. A solid substrate with an HE protein according to Claim 5 or 6, or an 
antigenic fragment thereof, attached thereto, for capturing anti-CRCV HE 
antibodies from a liquid sample, wherein anti-CRCV HE antibodies in a 
sample exposed to the solid substrate will bind to the HE protein. 

46. A solid substrate according to Claim 44 or 45, wherein the solid 
substrate is a microtitre well. 

47. A vaccine composition for vaccinating dogs comprising a 
coronavirus having an S protein with at 75% least amino acid identity with 
BCV S protein, or a coronaviral protein having at least 75% amino acid 
identity with a BCV protein or an immunogenic fragment thereof, or a 
nucleic acid encoding said coronaviral protein or immunogenic fraction 
thereof. 

48. A vaccine composition according to Claim 47 wherein the 
coronaviral protein is a BCV, HCV, HEV or CRCV protein, or a 
modification thereof. 

49. A vaccine composition according to Claim 47 or 48 wherein the 
coronaviral protein is an S protein. 
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50. A vaccine composition according to Claim 49 comprising an S 
protein according to any of Claims 1, 2, 14 or 17 , a BCV S protein, an 
HCV S protein, an HEV S protein, or an immunogenic fragment thereof. 

5 

51. A vaccine composition according to Claim 47 or 48 wherein the 
coronaviral protein is a hemagglutinin-esterase protein (HE) or an integral 
membrane protein (M). 

10 52. A vaccine composition according to Claim 51 wherein the HE 
protein comprises an HE protein as defined in Claim 5 or 6, a BCV HE 
protein, an HCV HE protein, or an immunogenic fragment thereof. 

53. A vaccine composition according to Claim 47 wherein the virus is 
15 selected from BCV, HCV, HEV and CRCV, or a modification thereof. 

54. A vaccine composition according to any of Claims 47 to 53 and also 
comprising a pharmaceutically acceptable adjuvant. 

20 55. A vaccine composition according to any of Claims 47 to 55 further 
comprising any one or more of: 

(a) an agent capable of raising an immune response in a dog 
against canine parainfluenza virus (CPIV); 

(b) an agent capable of raising an immune response in a dog 
25 against canine adenovirus type 2 (CAV-2); 

(c) an agent capable of raising an immune response in a dog 
against canine herpesvirus (CHV); and 

(d) an agent capable of raising an immune response in a dog 
against Bordetella bronchiseptica (B. bronchiseptica). 
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56. Use of (i) a coronavirus having an S protein with at least 75% amino 
acid identity with CRCV S protein, or (ii) a coronavirus having an S protein 
with at least 75% amino acid identity with BCV S protein, or (iii) a 
coronavirus having an HE protein with at least 90% amino acid identity 

5 with CRCV HE protein, or (iv) a coronavirus having an HE protein with at 
least 90% amino acid identity with BCV HE protein, or (v) a coronavirus 
protein having at least 75% amino acid identity with a CRCV protein or an 
immunogenic fragment thereof, or (vi) a coronaviral protein having at least 
75% amino acid identity with a BCV protein, or an immunogenic fragment 

io thereof, or (vii) a nucleic acid encoding said coronaviral protein or 
immunogenic fraction thereof, in the preparation of a medicament for 
stimulating an immune response against CRCV in a dog. 

57. Use of (i) a coronavirus having an S protein with at least 75% amino 
15 acid identity with CRCV S protein, or (ii) a coronavirus having an S protein 
with at least 75% amino acid identity with BCV S protein, or (iii) a 
coronavirus having an HE protein with at least 90% amino acid identity 
with CRCV HE protein, or (iv) a coronavirus having an HE protein with at 
least 90% amino acid identity with BCV HE protein, or (v) a coronavirus 
20 protein having at least 75% amino acid identity with a CRCV protein or an 
immunogenic fragment thereof, or (vi) a coronaviral protein having at least 
75% amino acid identity with a BCV protein, or an immunogenic fragment 
thereof, or (vii) a nucleic acid encoding said coronaviral protein or 
immunogenic fraction thereof, in the preparation of a medicament for 
25 prophylaxis of respiratory disease in a dog. 

58. Use according to Claim 56 or 57 wherein the coronaviral protein is a 
BCV, HCV, HEV or CRCV protein, or a modification thereof. 
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59. Use according to any of Claims 56 to 58 wherein the coronaviral 
protein is an S protein. 

60. Use according to Claim 59 wherein the S protein comprises an S 
5 protein according to any of Claims 1 , 2, 14 or 17, a BCV S protein, an HCV 

S protein, or an immunogenic fragment thereof. 

61 Use according to any of Claims 56 to 58 wherein the coronaviral 
protein is HE or M. 

10 

62 Use according to Claim 61 wherein the HE protein comprises an HE 
protein according to Claim 5 or 6, a BCV HE protein, an HCV HE protein, 
or an immunogenic fragment thereof. 

15 63. Use according to Claim 56 wherein the virus is selected from BCV, 
HCV, HEV and CRCV, or a modification thereof. 

64. An S protein according to any of Claims 1, 2, 14 or 17 for use in 
medicine. 

20 

65. An HE protein according to Claim 5 or 6 for use in medicine. 

66. A method of vaccinating a dog against CRCV, the method 
comprising administering to the dog a vaccine composition according to any 

25 of Claims 47to 54. 

67. A method for combating the spread of CRCV between dogs 
comprising determining whether a dog is infected with CRCV according to 
the method of any of Claims 24 to 36 and, if the dog is infected with CRCV, 

30 quarantining the dog. 
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68. A method for combating the spread of CRCV between dogs 
comprising determining whether a dog is infected with CRCV according to 
the method of any of Claims 24 to 36 and, if the dog is infected with CRCV, 

5 vaccinating other dogs that have been, are, or are likely to be in contact with 
the dog. 

69. A method for identifying a test vaccine capable of preventing canine 
infectious respiratory disease (CIRD) in dogs, comprising 

10 (a) determining whether a dog has been exposed to CRCV according 

to the method of any of Claims 24 to 36, 

(b) if the dog has not been exposed to CRCV, administering the test 
vaccine to the dog, 

(c) inoculating the dog with CRCV, and 

15 (d) determining whether the dog develops CIRD, 

wherein the absence of CIRD in step (d) indicates that the test 
vaccine is capable of preventing CIRD. 

70. A vaccine identified by the method of Claim 69. 

20 

71. The E. coli strain Spike D-l CRCV, containing a plasmid whose 
insert contains a portion of the CRCV S cDNA, as deposited by the Royal 
Veterinary College at the NCIMB under Accession number NCIMB 41146 
on 25 July 2002. 

25 

72. The plasmid contained in E. coli strain Spike D-l CRCV, deposited 
by the Royal Veterinary College at the NCIMB under Accession number 
NCIMB 41 146 on 25 July 2002. 
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73. A kit of parts for the immunosorbent assay according to any of 
Claims 37-43, comprising a solid phase, a CRCV or CRCV-like S protein 
and/or a CRCV or CRCV-like HE protein for coating the solid phase, and a 
detectable label conjugate. 

5 

74. A kit of parts according to Claim 73 wherein the solid phase 
comprises a microtitre plate. 

75. A kit of parts according to Claim 73 or 74 wherein the detectable 
10 label conjugate comprises an anti-dog antibody. 

76. A kit of parts according to Claim 73 or 74 wherein the detectable 
label conjugate comprises an enzyme. 

15 77. A kit of parts according to Claim 76 further comprising a substrate 
for the enzyme. 

78. A kit of parts according to any of Claims 73 to 77 further comprising 
a positive control sample that contains an anti-CRCV S protein antibody 

20 and/or an anti-CRCV HE protein antibody. 

79. A method of passively immunising a dog against CRCV, comprising 
administering an antibody that reacts with CRCV to the dog. 

80. A method according to Claim 79 wherein the antibody that reacts 
25 with CRCV comprises an anti-BCV antibody, an anti-HCV antibody, or an 

anti-CRCV antibody. 
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81. A method according to Claim 79 or 80 wherein the antibody that 
reacts with CRCV comprises an anti-S protein antibody or an anti-HE 
protein antibody. 

82. A method according to any of Claims 79 to 81 wherein the antibody 
that reacts with CRCV comprises the anti-CRCV antibody according to 
Claim 20 or 23. 

83. Use of an antibody that reacts with CRCV in the preparation of a 
medicament for passively immunising a dog against CRCV. 

84. Use according to Claim 83 wherein the antibody that reacts with 
CRCV is an anti-BCV antibody, an anti-HCV antibody, or an anti-CRCV 
antibody. 

85. Use according to Claim 83 or 84 wherein the antibody that reacts 
with CRCV is an anti-S protein antibody or an anti-HE protein antibody. 

86. Use according to any of Claims 83 to 85 wherein the antibody that 
reacts with CRCV comprises the anti-CRCV antibody according to Claim 
20 or 23. 

87. A method of diagnosing CIRD the method comprising determining 
whether a dog has been exposed to CRCV according to the method defined 
in any one of Claims 24 to 36. 
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FIGURE 1 |C 

ctcagatgaa tttgaaatat gctattagtg ctaagaatag agcccgcact gttgctggtg 60 

tttccatact tagtactatg actggcagaa tgtttcatca aaaatgtttg aaaagtatag 120 

cagctacacg tggtgttcct gttgttatag gcaccactaa attttatggc ggctgggatg 180 

atatgttacg tcgccttatt aaagatgttg acaatcctgt acttatgggt tgggattatc 24 0 

ctaagtgtga 25 C 



FIGURE 2 



QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKS I A ATRGVPWIG TTKFYGGWDD 
MLRRLIKDVE NPVLMGWDYP KCE 
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atgtttttga tacttttaat ttccttacca atggcttttg ctgttatagg agatttaaag 60 

tgtactacgg tttccatcaa tgatgttgac accggtgctc cttctattag cactgatgtt 120 

gtcgatgtta ctaatggttt aggtacttat tatgttttag atcgtgtgta tttaaatact 180 

acattgttgc ttaatggtta ttatcctact tcaggttcta catatcgtaa tatggcactg 240 

aagggaactt tactattgag cacactatgg tttaaaccac catttctttc tgattttatt 300 

gatggtgttt ttgctaaggt aaaaaatacc aaggttatta aagatggtgt agtgtatagt 360 

gagtttcctg ctataactat aggtagtact tttgtaaata catcctatag tgtggtagta 420 

caaccacata ctactaattt agataataaa ttacaaggtc tcttagagat ctctgtttgc 480 

cagtatacta tgtgcgatta cccacatacg atgtgtcatc ctaatctggg taataaacgc 54 0 

atagaactat ggcattggga tacaggtgtt gttccctgtt tatataagcg taatttcaca 600 
tatgatgtga atgctgatta tttgtattcc catttttatc aagaaggtgg tactttttat 
gcatatttta cagacactgg tgttgttact aagtttctgt ttcatgttta tttaggcacg 
gtgctttcac attattatgt catgcccttg acttgtaata gtgctatgac tttagaatac 
tgggttacac ctctcacttt taaacaatat ttactcgctt tcaatcaaga tggtgttatt 
tttaatgctg ttgattgtaa gagtgatttt atgagtgaga ttaagtgtaa aacactatct 
atagcaccat ctactggtgt ttatgaatta aacggttaca ctgttcagcc aattgcagat 

gtttaccgac gtatacctaa tcttcccgat tgtaatatag aggcttggct taatgataag 1020 
tcggtgcctt ctccattaaa ttgggaacgt aagacctttt caaattgtaa ttttaatatg 
agcagcctga tgtcttttat ccaggctgac tcgtttactt gtaataatat tgatgctgct 

aagatatacg gtatgtgttt tttcagcata actatagata agtttgctat acccaatggt 1200 

aggaaggttg acctacaaat gggcaatttg ggctatttgc agtcttttaa ctatagaatt 1260 

gatactactg ctacaagttg tcagttgtat tataatttac ctgctagtaa tgtttctatt 1320 

agcaggttta atccttctat ttggaatagg agatttggtt ttacagaaca atctgttttt 1380 

aagcctcaac ctgtaggtgt ttttactgat catgatgttg tttatgcaca acattgtttt 1440 

aaagctccca caaatttctg tccgtgtaaa ttgaatgggt ctttgtgtgt aggtagtggt 1500 

tttggtatag atgctggtta taaaaatagt ggtataggca cttgtcctgc aggtactaat 1560 
tatttaactt gttataatgc taaccaatgt gattgtttgt gcactccaga ccctatttta 
tctaaatcta cagggcctta taagtgcccc caaactaaat acttagttgg cataggtgag 

cactgttctg gtcttgctat taaaagtgat tattgtggag gcaatccttg tacttgccaa 1740 

ccaaaagcat ttttgggttg gtctgtggac tcttgtttac aaggggatag gtgtaatatt 1800 

tttgctaatt ttattttgca tggtgttaat agtggtacta cttgttctac tgatttacaa I860 

aaatcaaaca cagacataat tcttggtgtt tgtgttaatt atgatcttta tggtattaca 1920 

ggccaaggta tttttgttga ggttaatgcg acttattata atagttggca gaacctttta 1980 

tatgattcta atggtaatct ctatggtttt agggactact taacaaacag aacttttatg 2040 

attcgtagtt gctatagcgg tcgtgtttca gcgggctttc actctaactc ttccgaacca 2100 

gcattgctat ttcggaatat taaatgcaat tacgttttta ataatactct ttcacgacag 2160 

ctgcaaccta ttaactattt tgatagttat cttggttgtg ttgtcaatgc tgataatagt 2220 
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acttctagtt ctgttcaaac atgtgatctc acagtaggta gtggttactg gggggattac 2280 

tctacacaaa gacgaagtcg tagaacgatt accactggtt atcggtttac taattttgag 2340 

ccatttactg ttaatccagt aaatgatagt ttacaccctg taggtggttt gtatgaaatt 2400 

caaatacctt cagagtttac tataggtaat atggaggagt ttattcaaac aagatctcct 24 60 

aaagttacta ttgattgtcc tgtttttgtc tgtggtgatt atgcagcatg taaatcacag 2520 

ttggttgaat -atggtagttt ttgtgacaat attaatgcta tactcacaga agtaaatgaa 2580 

ctacttgaca ctacacagtt gcaagtagct aatagtttaa tgaatggtgt cactcttagc 2640 

actaagctta aagatggctt taatttcaat gtagatgaca tcaatttttc ccctgtatta 2700 

ggttgtttag gaagcgaatg taataaagtt tccagtagat ctgctataga ggatttactt 27 60 

ttttctaaag taaagttatc tgatgttggt tttgttgatg cttataataa ttgtactgga 2820 

ggtgccgaaa ttagggacct catttgtgtg caaagttata atggtatcaa agtgttgcct 2880 

ccactgctct cagaaaatca gatcagtgga tacactttgg ctgccacctt tgctagtctg 2940 

tttcctcctt ggtcagcagc agcaggcgta ccattttatt taaatgttca gtatcgtatt 3000 

aatggtattg gtgttaccat ggatgtgcta actcaaaatc aaaagcttat ttctaatgca 3060 

tttaacaatg cccttgatgc tattcaggaa gggtttgatg ctaccaattc tgctttagtt 3120 

aaaattcaag ctgttgttaa tgcaaatgct gaagctctta ataacttatt gcaacaactc 3180 

tctaataaat ttggtgctat aagtgcttct ttacaagaaa trcuarctag acttgatgct 3240 

cttgaagcgc aagctcagat agacagactt atcaatgggc gtcttaccgc tcttaatgct 3300 

tatgtttctc aacagcttag tgattctaca ctagtaaaat ttagtgcagc acaagctatg 3360 

gagaaggtta atgaatgtgt caaaagccaa tcatctagga taaatttttg tggtaatggt 3420 

aatcatatta tatcattagt gcagaatgct ccatatggtt tgtattttat ccactttagc 3480 

tatgtcccta ctaagtatgt cactgcgaag gttagtcccg gtctgtgcat ygcaggtgat 3540 

agaggtatag ctcctaagag tggttatttt gttaatgtaa ataacacttg gatgttcact 3600 

ggtagtggtt attactaccc tgaacctata actggaaata atgtggttgt tatgagtacc 3660 

tgtgctgtta actatactaa agcaccggat gtaatgctga acatttcaac acccaacctc 3720 

cctgatttta aggaagagtt ggatcaatgg tttaaaaacc aaacattaat ggcaccagat 3780 

ttgtcacttg attatataaa tgttacattc ttggacctac aagatgaaat gaataggtta 3840 

caggaggcaa taaaagtttt aaatcatagc tacatcaatc tcaaggacat tggtacatat 3900 

gaatattatg taaaatggcc ttggtatgta tggcttttaa ttggccttgc tggcgtagct 3960 

atgcttgttt tactattctt catatgctgt tgtacaggat gtgggactag ttgttttaag 4020 

aaatgcggtg gttgttgtga tgattatact ggacatcagg agttagtaat caaaacgtca 4080 

catgacgact aa 4092 
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FIGURE 6 

T101 CTCAGATGAATTTGAAATATGCTATTAGTGCTAAGAATAGAGCCCGCACTGTTGCTGGTG 
BCV CTCAAATGAATTTGAAATATGCTATTAGTGCTAAGAATAGAGCCCGCACTGTTGCTGGTG 
OC4 3 CTCAAATGAATTTGAAATATGCTATTAGTGCTAAGAATAGAGCCCGCACTGTTGCTGGTG 
HEV CTCAAATGAATTTGAAATATGCTATTAGTGCCAAGAATAGAGCCCGCACTGTTGCTGGTG 
CCV CTCAGATGAATTTGAAATATGCTATTTCTGGAAAGGCTAGAGCTCGTACAGTAGGAGGAG 
**** ********************* *** ****** ** ** ** * ** * 

T101 T T T C CAT AC T T A GT AC TAT G AC T G G C A G AAT G T T T CAT C AAAAAT GT T T G AAAAG TAT AG 
BCV TTTCCATACTCAGTACTATGACTGGCAGAATGTTTCATCAAAAATGTTTGAAAAGTATAG 
OC43 T T T C CAT AC T T AG T AC TAT G ACT GGCAGAAT GT T T CAT C AAAAAT GT T T GAAAAGT AT AG 
HEV T T T C C AT A C T TAG T AC TAT G AC T G G C AG AAT G T T T CAT C AAAAAT G C T T G AAAAG TAT AG 
CCV T T T C AC T T C T T T C T AC C AT G AC T AC GAG AC AAT AC C ACCAGAAG CAT T T G AAGT C AAT T G 
**** * ** *** ****** *** * ** ** ** ***** ** * 

T101 CAGCTACACGTGGTGTTCCTGTTGTTATAGGCACCACTAAATTTTATGGCGGCTGGGATG 
BCV CAGCTACACGTGGTGTTCCTGTTGTTATAGGCACCACTAAGTTTTATGGCGGCTGGGATG 
OC4 3 CAGCTACACGTGGTGTTCCTGTAGTTATAGGCACCACTAAATTTTATGGTGGCTGGGATG 
HEV CAGCTACACGTGGCGTTCCTGTGGTTATAGGCACCACTAAATTTTATGGCGGCTGGGATG 
CCV CTGCAACACGCAATGCCACTGTGGTTATTGGCTCAACCAAGTTTTATGGTGGTTGGGATA 

* +± ***** * **** ***** *** * ** ** ******** ** ****** 

T101 ATATGTTACGTCGCCTTATTAAAGATGTTGACAATCCTGTACTTATGGGTTGGGATTATC 
BCV ATATGTTACGTCGCCTTATTAAAGATGTTGATAATCCTGTACTTATGGGTTGGGATTATC 
OC4 3 ATATGTTACGCCGCCTTATTAAAGATGTTGACAATCCTGTACTTATGGGTTGGGATTATC 
HEV ATATGTTACGCCGCCTTATTAAAGATGTTGATAATCCTGTACTTATGGGTTGGGATTATC 
CCV ACATGCTTAAAAATTTAATGCGTGATGTTGATAATGGTTGTTTGATGGGATGGGACTATC 

* *** * * ** ******** *** * * ***** ***** **** 

T101 CTAAGTGTGA 

BCV CTAAGTGTGA 

OC4 3 CTAAGTGTGA 

HEV CAAAGTGTGA 

CCV CTAAGTGTGA 
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FIGURE 7 



pr ot HCVpoly MNLKYAI SAKN RART VAGVS I L STMT GRMFHQKCLKS IAATR 

protHEVpoly MNLKYAI SAKN RART VAGV S I L S TMT GRM FHQKCLKS IAATR 

protBCVpoly MNLKYAI SAKNRARTVAGVSILSTMTGRMFHQKCLKS IAATR 

protCRCVpol — QMNLKYAI SAKNRARTVAGVSILSTMTGRMFHQKCLKS IAATR 

protCECVpol MTQMNLKYAISGKARARTVGGVSLLSTMTTRQYHQKHLKS IAATR 

******** * ***** ********* * -*** ******** 

• • • * 

protHCVpoly GVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKC 
protHEVpoly GVPWIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKC 
protBCVpoly G V P V V IGTTKFYGGWD DMLRRL IKDVDNPVLMGWDYPKC 

protCRCVpol GVPWIGTTKFYGGWDDMLRRLIKDVENPVLMGWDYPKC — 

protCECVpol NATVVIGSTKFYGGWDNMLKNLMRDVDNGCLMGWDYPKC 

****.********.**• *••**•* ********* 
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CRCVspike 
CECVspike 



ATGTTTTTGATACTTTTA ATTTCCTTACCAATG 

ATGATTGTGCTCGTAACTTGCATTTTATTGTTATGTTCATACCACACTGCTTCGAGTACG 



k kr k k -k -k 



CRCVspike 
CECVspike 



GCTTTTGCTG-TTATAGGAGATTTAAAGTGTACTACGGTTTC-CATCAATGATGTTGACA 
T C AAAT AAT GAT T G T AG AC AAG T T AA — C G T AAC AC AAT TAG AT G G C AAT GAAAAC C T C A 



•k -k kk kk -kk-k k k -k -k -k kr kr k kr -k k kr 



•k k kr k -k k 



■k* 



CRCVspike CCGGTG-CTCCTTCTATTAGCACTGATGTTGTCGATGTTACTAATGGTTTAGGTACTTAT 
CECVspike TTAGAGACTTTTTGTTTCAAAACTT-TAAAGAAGAAGGAACTGTAGTTGTTGGTGGTTAC 

k kr ** kk kr kr k kkk k k *kkr k -k-kk kr k k k * kr k * * 



CRCVspike T AT GT TT TAGA TCGTGTG — TAT TT AAAT ACT AC A TTGTTGCTTAATGGTTA 

CECVspike T AC C C T AC AG AGG T T T GG TAT AAC T G T T C T AGAAC AG CAAC AAC T AC T G C C T A- T GAG T A 

■kk kr kkk kr kk k kr kk kr -k kr k- k k kkk kk -k k kr kr 



CRCVspike TTATCCTACTTCAGGTTCTACATATCGTAATATGGCA-CTGAAGGGAACTTTACTATTGA 
CECVspike T T T C AG T AAT AT AC AC G CAT T C TAT T T T GAT AT G G AAG C CAT G GAG AAT AG T AC T G G T AA 

■k*k kk k k k kkk: k kkk^ckk k k kr -kkk kk-kk ~k kr 



CRCVspike 
CECVspike 



-GCACACTATGG-TTTAAACCACCATTTCTTTCTGATTTTATTGATGGTGTTTTTGCTAA 
TGCACGTGGTAAACCTTTATTATTTCATGTTCATGGTGAGCCTGTTAGTGTCATCATATA 

•k-k-kkr k kkk k kk -kk k -kk * kkkkr kr -k 



CRCVspike 
CECVspike 



G G T AAAAAAT AC C AAGG T TAT T AAAG AT G G T G TAG T G TAT AG TGAGTTTCCTGCTAT 

CATATCTTATAGAGATGATGTGCAACATAGGCCACTTTTAAAACACGGATTAGTGTGCAT 



k kr k -k -k -k -k k k -k ~k kk k -k *k k -k 



k kk 



k-k 



CRCVspike AACTATAGGTAGTACTTTTG — TA-AATACATCCTATAGTGTGGTAGTACAACCACATAC 

CECVspike AACTG/^AAGTCGCAACATTGACTATAACAGTTTCACCAGTA-GCCAGTGGAATTCCATAT 

•k-k-kk k kk k k kkk k kr -k -k k k k kr k -kkk -k k kr -k k k 



CRCVspike 
CECVspike 



- T AC T AAT T TAG AT AAT AAAT T AC AAG G T C T C T TAG AG AT CTCTGTTTGC C AG TAT AC T A 
GTACGGGTAATGACAGAAAAATTCCTT-TCTCTGTCATACCCACGGACAATGGAACAAAA 

k-kk k kkr kr kk-k k * kr k kr k k kk-k k k k k * 
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-TGTGCGATTACCCACATA-CGATGTGTC-ATCCTAATCTGGGT-AATAAACG — CATAG 
ATTTATGGTCTTGAGTGGAATGATGAATTTGTTACAGCGTACATTAGTGGTCGTTCTTAT 
k k * * * **** * * * * * * * * 

AACTATGGCATTGGGATACAGGTGTTGTTCCCTGTT-TATATAAGCGTAATTTCACATAT 
AATTGGAACATCAATAATAATTGGTTTAACAATGTCACGCTTCTGTATAGTCGCTCAAGC 
* *** * * *** * * * ** * * -kk 

GATGTGA-ATGCTGATTATTTGTATTCCCATTTTTATCAAGAAGGTGGTACTTT TTA 

ACTGCCACATGGCAACACAGTGC-TGCATACGTTTACCAAGGTGTTTCTAACTTCACTTA 

* kkk k kk k k k kkkk kkkk k k kk kk kkk 

TGCATATTTTACAGACACTGGTGTTGTTACTAAGTTTCTGTTTCATGTTTAT-TTAGGCA 
TTACAAGTTAAATAACACCAATGGTCTAA — AAAC C TAT G AAT TAT G T G AAG AT TAT G AA 

* k kk k kkkk kk kkk kk k kk k kkkk k kkk k k 

CGGTGCTTT CACATTATTA-TGTCATGCCCTTGACTTGTAATAGTGCTATGACTTTA 

TATTGCACTGGCTACGCCACTAACATCTTTGCCCCAACTGTGGGAGGTTACATACCTGAT 

kkk k kk k kk kk k kk kkk kk kk k -k 

GAATACTGGGTTA CACCTCTCACTTTTAAACAATATTTACTCGCTTTCAATCAAG 

GGATTTAGTTTTAACAATTGGTTTTTGCTTACAAACAGCTCCACTTTTGTTAGTGGCAGA 

* kk kr kkk k k kkk kkkkk k kk kk 

ATGGTGTTATTTTTAATGCTGTTGATTGTAAGAGTGATTTTATGAGTGAGATTAAGTGT- 
TTTGTAACAAATCAACCATTATTAGTTAATTGCTTGTGGCCAGTTCCTAGTTTTGGTGTT 

k kk kkk k kk kk k kk k kk kk kkkk 

— AAAAC AC TAT C TAT AG C AC CAT C T AC T G G T G T T TAT G AAT T AAAC G G T TAG AC T G T T C 
GCAGCACAAGAATTTTGTTTTGAAGGTGCACAGTTTAGCCAATGTAATGGTGTGTTTTTA 

k kkk kkk k kkkkk kkkkk k -k k 

AG C C A- AT T G C AG AT G T T TAG C G AC G TAT AC C T AAT C T T C CC G — ATTGTAATATAGAGG 
AATAACACAGTAGATGTCATTAGATTCAACCTTAATTTTACTGCAGATGTACAATCTGGC 
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CTTGGCTTAATGATAAGT-CGGTGCCTTCTCCATTAAATTGGGAACGTAAGACCTTTTCA 
ATGGGTGCTACAGTATTTTCACTGAATACAACAGGTGGTTGCATTCTTGAGATTTCTT — 



CRCVspike 
CECVspike 



AATTGTAATTTTAATATGAGCAGCCTGATGTCTTTTATCCAGGCTGACTCGTTTACTTGT 
- G T T AT AAT GAT AT AGT G AG CGAG T C AAGT T T C T AC AG T TAT GG T G A AATTCCCTTC 



** **** ** ***** 



* * * * * * -kkk 



** * k 



CRCVspike 
CECVspike 



AAT AAT AT T GAT GC T G C T AAG AT AT AC G G TAT GTGTTTTTT C A — GCATAACTATAGATA 
GGCGTAACTGATGG-ACCGCGTTAT-TGTTATGTCCTCTATi^ATGGCACAGCTCTTAAGT 
* ***** * k *** * ***** * k k k *** k k-k k k 



CRCVspike 
CECVspike 



AGTTTGCTA TACCCAATGGTAGGAAGGTTGACCTACAAATGGGCAATTTGGGCTATT 

AT T T C G G C AC AT T AC C C C CT AG T G T C AAG G — AAATTGCTATTAG-TAAGTGGGGCCAAT 
* -k kr k k ***** * ** **** * * * * * * ** * **** * * 



CRCVspike 
CECVspike 



TGCAGTCTTTTAACTATAGAATTGATACTACTGCTACAAGTTGTCAGTTGTATTATAATT 
TTTATATTAATGGTTACAATTTCTTTAGCACTTTTCCTATTGATTGTATATCTTTTAACT 
k * * * ** * * ** *** * * * * * * * ** *** * 



CRCVspike 
CECVspike 



TACCTGCTAGTAATGTTTCTATTAGCAGGTTTAATCCTTCTATTTGGAATA — GGAGATT 
T AAC C AC T GG T GAT AG T G GAG CAT T T T G G AC AAT T G C T T AC AC AT C G TAG AC T GAAG CAT 



** * ** ** ** * 



** * * *** * * * * * * ** * 



CRCVspike 
CECVspike 



TGGTTTTA-CAGAACAATCTGTTTTTAAGCCT-CAACCTGTAGGTGTTTTTACTGATCAT 
TAG T AC AAG T T G AAAAC AC AG C CAT T AAAAAG G T G AC G TAT T G T AAC AG T C A C - AT T AA T 



* ** * *** * * * **k* 



* * * * * 



* ** * ** 



CRCVspike 
CECVspike 



GATGTTGTTTATGCACAACATTGTTTTAAAGCTCCCACAAATTTCTGTCCG TGTA 

AACATCAAATGTTCTCAACTTACTGCTAATTTGCAAAATGGCTTTTATCCTGTTGCTTCA 
* * *kk **** * * kkk k * ** * *** * * 



CRCVspike 
CECVspike 



AATTGAATGGGTCTTTGTGTGTAGGTAGTGGTTTTGGTA — TAGATGCTGGTTATAAA — 
AGTGAAGTTGGTCTTGTCAATAAGAGTGTTGTGTTACTACCTAGTTTCTATTCACATACC 



* * * * ****** 



** ** ** ** ** *** * ** * * * * 



WO 2004/011651 



PCT/GB2003/002832 



11/40 

FIGURE 8 (Page 4 of 9) 



CRCVspike 
CECVspike 



AATAGTGGTATAGGCACTTGTCCTGCAGGTACTAATTATTTAACTTGTTATAATGCTAAC 
AG T G T T AAT AT AAC TAT T GAT C T T G GTATGAAGCGTAGTGGTTATGGTCAACCCA — 



CRCVspike CAATGTGATTGTTTGTGCACTCCAGAC — CCTATTTTATCTAAATCTACAGGGCCTTA-T 

CECVspike T AGC C T C AAC AC T AAGT AAC A T C AC AC T AC C AAT G C AG GAT AAT AAC AC C GAT G T G TAG T 

* * kr -k -k-k ** -k-k-k -k-k * ** -k 



CRCVspike 
CECVspike 



AAGTGCCCCCAAACTAAATACTTAGTTGGCATAGGTGAGCACTGTTCTGGTCTTGCTATT 
GTATTCGTTCTAACCAATT-CTCAGTTTATGTTCACTCCACTTGCAAAAGTTCTTTATGG 

■k-k -k -k-k-k -k-k -k -kk -k-k-k-k -k -k-k * * * 



CRCVspike AAAAGTGATTATTGTGGAGGCAATCCTTGTACTTGCCAACCAAAAGCATTTTTGGG— TT 

CECVspike G AC AAC AAT T T T AAT C AAG AT T G C AC AG AT G T T T TAT AT G C C AC AG C T G T T AT AAAAAC T 

•k k -k-k-k kr -k -kkr kr kr -k-k -k -k k -k-k-k -k-k -k -k 



CRCVspike GGTCTGTGGAC — TCTTGTTTACAAGG — GGATAGGTGTAATATTTTTGCTAA-TTTTAT 

CECVspike GGTACTTGCCCCTTCTCATTTGATAAATTGAATAATTACTTAACTTTTAACAAGCTTTGT 

-k-k-k -k-k -k -k-k-k -k-k-k -k * k-kk kr -k **-k* k-k-k -k 



CRCVspike TTTGCATGGTGT — TAATAGTG GTACTACTTGTTCTACTGATT-TACAAAAATC 

CECVspike TTGTCGTTGAATCCTACTGGTGCCAACTGTAAGTTTGATGTTGCTGCCCGTACAAGAACC 
■k-k k-k-k -k *-k -k -k-kk -k-k-k -k-k-k -k-k-k -k-k-kk-k kr 



CRCVspike 
CECVspike 



AAACACAGACATAATTCTTGGTGTTTGTGTTAATTATGATCTTTATGGTATTACAGGCCA 
AA-TGAGCAGGTTGTTAGAAGTTTATATGTAATATATGAAGAAGGAGACAACATAGTGGG 



-k-k -k k -k-k-k -k 



-k-k-k-k* 



CRCVspike AGGTATTTTTGTTGA G G T T AAT G C G AC T TAT TAT AAT AG T T G G C AG AAC C T T T TAT 

CECVspike TGTACCGTCTGATAATAGTGGTCTTCACGATTTGTCAGTGTTACACTTAGACTCCTGTAC 
* * * * -k-k-k -k-k-k-k -k -k-k-k k -k -k-k 



CRCVspike 
CECVspike 



AT GAT T C T AAT G GTAATCTCTATGGTTTTAGGGACTACTTAACAAACAGA-ACTTTT 

A- GAT T AC AAT AT AT AT G G T AG AAC TG G T G T T - G G TAT TAT T AG AC AAAC T AAC AG C AC A 

■k -kkk-k -k-k* * kr -k-k-k-k kkr -kkr -k *kr -k -k-k* -k-kkr -k-k 
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CRCVspike ATGATTCGTAGTTGCTATAGCG-GTCGTGTTTCAGCGGGCTTTCA CTCTAACTCTTC 

CECVspike ATACTTAGTGGCTTACATTATACATCACTATCAGGTGATTTATTAGGTTTTAAAAATGTT 

***** * * ±* ** ****** * ** * 

CRCVspike CGAACCAGCATTG-CTATTTCGGT^ATATTAAATGCAATTACGTTTTTAATAATACTCTTT 

CECVspike agtgatggtgttgtctattctgtgacaccatgtgatgtaagcgcacaagcggctgttatt 

* * *** *****' ******** * * 

CRCVspike CACG- ACAGCTGCAACCTATTAACTATTTTGATAGTTATCTTGGTTGTGTTGTCAA 

CECVspike GATGGGGCCATAGTTGGAGC-TATGACTTCCATTAATAGTGAACT-GTTAGGTCTAACAC 

* * ******* *** * * *± ***** ****** * 

CRCVspike TGCTGATAATAGTAC TTCTAGTTCTGTTCAAACATGTGATCTCACAGTAGGTAGT 

CECVspike AT T GGACAAC AACACC AAAT T T TT AT T AC T AC T CT A- T AT AT AAT AC AAC AAAT GAG 

** ** * ** ** * ** * ** * ** *** * * 

CRCVspike G GT T AC T G G GGG GAT T AC T C T AC AC AAAG AC GAAG T CGTAGAACGATTACCACTGG 

CECVspike AGA-ACTCGTGGCACTGCAATCGACAGTAACGATGTAGATTGTGAACCTATCATAACCTA 

* *** ****** * *** **** *± ******* ** 

CRCVspike TT ATCGGTTT ACT AAT T T T G AG CC AT T T AC T G T T AAT C C AG T AAAT GAT AG 

CECVspike TTCTAACATAGGTGTTTGTAAAAATGGTGCGTTGGTTTTTATTAACGTCACACATTCTGA 

** *** * * *** ** * ** * **** * ** * 

CRCVspike TTTACACCCTGTAGGTGGTTTGTAT — GAAAT-TCA-AATACCTTCAGAGTTTACTATAG 

CECVspike T GGAG AT G T T - C AAC C AAT T AGC AC T G GC AAT G T CAC GAT AC C C AC AAAC T T T AC CAT AT 

* * * * * ** * * * *** ***** ** * ***** *** 

CRCVspike GTAATATGGAGGAGTTTATTCAAACAAGATCTCCTAAAGTTACTATTGATTGTCCTGTTT 
CECVspike C T G T G C AA G T T G AAT AC AT C C AG G T T TAG AC T AC AC C G G T G T C AAT AG AT T G T T C TAG AT 

* ******** ** * ** * ** ****** ** * 

CRCVspike TTGTCTGTGGTGATTATGCAGCATGTAAATCACAGTTGGTTGAATATGGTAGTTTTTGTG 
CECVspike ACGTTTGTAATGGTAACCCTAGATGTAATAAATTGTTAACACAATATGTTTCTGCATGTC 

** *** ** * * * ****** * *** ****** * * 
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CRCVspike ACAATATTAATGCTATACTCACAG-AAGT — AAAT G AAC T ACT T GAG AC T A 

CECVspike AAACTATTGAGCAAGCGCTTGCAATGAGTGCCAGCCTTGAAAACATGGAAGTTGATTCCA 



k k k k k k k 



k-k -kkr kkk 



k -k*kk k k 



CRCVspike 
CECVspike 



CRCVspike 
CECVspike 



C AC AGT T G C AAG TAG C T AAT AG T T T AAT G AAT GG T G T CAC T CT TAG C AC T AAG C T T AAAG 
TGTTGTTTGTTTCAGAAAATGCCCTTA-AATTGGCATCTGTTGAGGCGTTCAATAGTACA 

kkk kk kkk k k k k-kkr kk k kk k k k 

ATGGCTTTAATTTCAATGTAGATGACAT CAATTT TTCCCCTGTATTAGGTTGT 

G AAC AT T TAG AT C C TAT T T AC AAAG AAT G G C C T AAC AT AG GTGGTTCTTGGC TAG G AG GT 

•k-kkk k k kk kk k kk kk k k kk kkkk kk 



CRCVspike 
CECVspike 



TTAGGAAGCGAAT GTAATAA-AGTTTCCAGTA — GATCTGCTATAGAGGAT 

C T AAAAG AC AT AC TTCCGTCC CAT AAT AG C AAAC G T AAG TAT CGTTCTGC TAT AG AAG AC 



k-k kkk 



kkkkk k 



kkkk k kkkkkkkk*kk kk 



CRCVspike 
CECVspike 



TTACTTTTTTCTAAAGTAAAGTTATCTGATGTTGGTTTTGTTGATGC T TAT AAT AAT 

TTGCTTTTTGATAAAGTTGTAACTTCTGGTCTAGGTACAGTTGATGAAGATTATAAACGT 



k-k k*k*kk kkkkkk 



kkkk k k kkk k k k k kk k k*kkkk k 



CRCVspike 
CECVspike 



TGTACTGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATAATGGTATCAAA 

T G T AC AG G T G G T TAT G AC AT AG C T G AC T TAG T T T G T G CAC AAT AT T AC AAT G G CAT CAT G 
k-k-kkk kk kkk kk kk kkk k kkkkkk kkk kkk kkkkk kkkk 



CRCVspike 
CECVspike 



GTGTTGCCTC-CACTGCTCTCAGAAAATCAGATCAGTGGATACACTTTGGCTGCCACCTT 
GTTCTACCTGGTGTTGCTAAT-GATGACAAGATGACTATGTACACAGCCTCTCTTGCAGG 

kk k kkk kkkk kk k kkkk k k kkkkk kk k 



CRCVspike 
CECVspike 



TGCTAGTCTGTTTCCTCC-TTGGTCAGCAGCA — GCAGGCGTACCATTTTATTTAAATGT 

TGGTATAGCATTAGGTGCACTAGGTGGTGGCGCCGTGGCTATACCTTTTGCAGTAGCAGT 
** kk kk kkkk k kk k k -kkkk kkk kk kk 



CRCVspike 
CECVspike 



T C AG TAT C G TAT T AAT G G TAT T G G T G T TAG CAT G GAT G T GC T AAC T C AAAA T C AAAAG C T 
T C AG G C TAG AC T T AAT TAT G T T G C T C TAG AAAC T GAT G TAT T G AAC AAAAAC C AG C AG AT 

**** * * kkkkk k kkk k k k kkkkk k k kkkk k-k kk k 
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TATTTCTAATGCATTTAACAATGCCCTTGATGCTATT CAGGAAGGGTT — 

CCTGGCTAATGCTTTCAACCAAGCTATTGGTAACATTACACAGGCATTTGGTAAGGTTAA 
* ******* ** *** * ** *** * *** ** * **** 

TGATGCTA CCAATTCTGCT T TAG T T AAAAT 

T GAT GC TAT AC AT C AAAC AT C AC AAG G T C T T G C C AC T G T T GC T AAAG CAT T GG C AAAAG T 
******** *** * **** ** * *** * 

TCAAGCTGTTGTTAATGCAAATGCTGAAGCTCTTAATAACTTATTGCAACAACTCTCTAA 

G C AAG AT GT T G T T AAC AC AC AAGG G C AAGC T T T AAG C C AC CT AAC AG T ACAAC T GCAAAA 
**** ********* ** * * ***** * * ** ** ****** ** 

TAAATTTGGTGCTATAAGTGCTTCTTTACAAGAAATTCTATCTAGACTTGATGCTCTTGA 

T AGC T T C C AAG CC AT TAG T AGT T CT AT TAG T GAC AT T TAT AAT AG GC T T GAT GAAC T GAG 
** ** ** ** *** **** * ** *** *** ******* ** 

AGCGCAAGCTCAGATAGACAGACTTATCAATGGGCGTCTTACCGCTCTTAATGCTTATGT 
T GC T GAT G C AC AAGT T G AT AGG C T GAT T AC AGG T AG AC T T AC AG C AC T T AAT G CAT T T G T 
** ***** * ** ** ** ** * ** * ***** ** ******** * *** 

T T C T C AAC AG C T T AG T GAT T C T AC AC TAG T AAAAT T TAG T G C AG C AC AAG C T AT G G AG AA 
ATCTCAGACTCTAACCAGACAAGCGGAGGTTAGGGCTAGTAGACAACTTGCCAAAGACAA 
***** ** * , ,* ** * **** * ** ** * ** ** 

GGTTAATGAATGTGTCAAAAGCCAATCATCTAGGATTVAATTTTTGTGGTAATGGTAATCA 
GGTTAATGAATGTGTTAGGTCTCAGTCTCAGAGATTTGGATTTTGTGGTAATGGTACACA 
*************** * ** ** ** * **************** ** 

TATTATATCATTAGTGCAGAATGCTCCATATGGTTTGTATTTTATCCACTTTA-GCTATG 
TTTGTTTTCACTTGCAAATGCAGCACCAAATGGCATGGTTTTCTTTCACACAGTGCTAT- 
* * * *** * * * ** *** **** ** *** * *** ***** 

TCCCTACTAAGTATGTCACTGCGAAGGTTAGTCCCGGTCTGTGCATYGCAGGTGATAGAG 
TACCAACAGCTTATGAAACTGTAACAGCTTGGTCAGGTATTTGTGCTTCAGATGGCGATC 
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GTA TAGCTCCTAAGAGTGGTTATTT TGTT AATGTAAATAACA 

GCACTTTTGGACTTGTCGTTAAAGATGTTCAGTTGACGTTGTTTCGTAATCTAGATGACA 

* * * * *** ** * * ** **** *** *** 

CTTGGATGTTCACTGGTAGTGGTTATTACTACCCTGAACCTATAACTGGAAATAATGTGG 
AGTTCTATTTGACTCCCAGAACTATGTATCAGCCTAGAGCTGCAACTAGTTCTGATTTTG 

* ** *** ** * ** * *** * ** **** k * ** * k 



CRCVspike 
CECVspike 



TTGTTATGAGTACCTGTGCTGTTAACTATACTAAAGCACCGGATGTAATGCTGAACATTT 
TTCAGATTGAGGGGTGCGACGTGTTGTTTGTCAATGC7VACTGTAATTGACTTGCCTAGTA 



** kk 



** * kk k * ** *** * * * * * 



CRCVspike 
CECVspike 



CAACACCCAACCTCCCTGATTTTAAGGAAG AG T T G GAT C AAT G G T T T AAAAAC 

T T AT ACC T GAC TAT AT C GACAT T AAT C AGAC TGTT C AAG AC AT AT T AG AAAAC T AC AG AC 



k -kkk k-k 



kk * * * * k 



k k * * k 



■k-k k kk 



CRCVspike 
CECVspike 



C AAAC AT T AAT GG C ACC AG AT T T G T C AC T T GAT T AT AT AAAT G T T AC AT T C T T GG AC C T A 
C AAAC - T GG ACT G T AC C T GAAT T GAC AAT T GACAT T T T T AACG C AACC T AT T T AAAT C T G 



■kkk -k-k * -k 



k *** -k-k *** -k-k **** k k -k-k k -kk- k -k-k * -k-k 



CRCVspike 
CECVspike 



C AAG AT G AAAT GAAT AG G T T AC AG G — AG GC AAT AAAAG T T T T AAAT CAT AG C 

AC T G G T G AAAT T GAT GAC T TAG AAT T T AGG T C AG AAAAGC T AC AT AAC AC C AC AG TAG AG 
* ****** ** *** * k-kk * kkkkk k ******* 



CRCVspike 
CECVspike 



CRCVspike 
CECVspike 



T AC AT — — CAAT C T C AAG GACAT T G GT AC A 

CTTGCCATTCTCATTGACAATATTAACAATACATTAGTCAATCTTGAATGGCTCAATAGA 

***** ****** * * ** k 

TATGAATATTATGTAAAATGGCCTTGGTATGTATGGCTTTTAATTGGCCTTGCTGGCGTA 
ATTGAAACTTATGTGAAATGGCCTTGGTATGTGTGGCTACTAATAGGC-TTAGTAGTAGT 
**** ****** ***************** ***** **** *** ** * * 



CRCVspike 
CECVspike 



GCTATGCTTGTT-TTACTATTCTTCATATGCTGTTGTACAGGATG— TGGGACTAGTTG 
GTTTTGCATACCGCTATTGCTATTTTGCTGTTGTAGTACAGGTTGCTGTGGATGCATAGG 
* * *** * ****** ** *** ******* ** *** * * 
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TTTTAAGAAATGCGGTGGTTGTTGTGATGATTATACTGGACA — T CAG GAG T TAG T AAT C 
TTGTTTGGGAAGTTGTTGTCATTCTATTTGTAGTAGAAGACAATTTGAAAATTACGAACC 

* * * * **** ** * * * ** **** * * *** * 



CRCVspike AA AACGTCACATGACGACTAA— 

CECVspike AAT T G AAAAAG T G CAT G T C C AC T AAA- 
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BCVspike ATGTTTTTGATACTTTTAATTTCCTTACCAATGGCTCTTGCTGTTATAG 

HCVspike ATGTTTTTGATACTTTTAATTTCCTTACCAACGGCTTTTGCTGTTATAG 

CRCVspike ATGTTTTTGATACTTTTAATTTCCTTACCAATGGCTTTTGCTGTTATAG 

HEVspike ATGTTTTTTATACTTTTAATCACCCTGCCTTCTGTTTTTGCAGTTATAG 

******** * * * * * * * * * * * ** * * * * * * * * * * * * * * * * 

BCVspike GAGATTTAAAGTGTACTACGGTTTCCATTAATGATGTTGACACCGGTGTTCCTTCTGTTA 

HCVspike GAGATTTAAAGTGTACTACGGT.TTCCATTAATGATATTGACACCGGTGCTCCTTCTATTA 

CRCVspike GAGATTTAAAGTGTACTACGGTTTCCATCAATGATGTTGACACCGGTGCTCCTTCTATTA 

HEVspike G GG AT T T AAAGT G T AAT AC T T CAT C AAT T AAT G AC G T T G AC AC TGGTGTGC CAT C TAT T A 

* ************* *** ** ** ***** ******* **** ** *** *** 

BCVspike GCACTGATACTGTCGATGTTACTAATGGTTTAGGTACTTATTATGTTTTAGATCGTGTGT 

HCVspike GCACTGATATTGTCGATGTTACTAATGGTTTAGGTACTTATTATGTTTTAGATCGTGTGT 

CRCVspike GCACTGATGTTGTCGATGTTACTAATGGTTTAGGTACTTATTATGTTTTAGATCGTGTGT 

HEVspike GCTCTGAAGTTGTTGATGTCACTAATGGTTTGGGGACTTTCTATGTTTTAGATCGTGTCT 

** **** *** ***** *********** ** **** * ** * ** * * ** ** * * * * * * 

BCVspike ATTTAAATACTACGTTGTTGCTTAATGGTTACTACCCTACTTCAGGTTCTACATATCGTA 

HCVspike ATTTAAATACTACGTTGTTGCTTAATGGTTACTACCCTACTTCAGGTTCTACATATCGTA 

CRCVspike ATTTAAATACTACATTGTTGCTTAATGGTTATTATCCTACTTCAGGTTCTACATATCGTA 

HEVspike ATTTAAATACCACATTGTTGCTCAATGGTTATTACCCAATTTCAGGTGCTACATTTCGTA 

********** ** ******** ******** ** ** * ******* ****** ***** 

BCVspike ATATGGCACTGAAGGGAACTTTACTATTGAGCACACTATGGTTTAAACCACCTTTTCTTT 
HCVspike ATATGGCACTGAAGGGAACTTTACTATTGAGCAGACTATGGTTTAAACCACCTTTTCTTT 
CRCVspike ATATGGCACTGAAGGGAACTTTACTATTGAGCACACTATGGTTTAAACCACCATTTCTTT 
HEVspike ATGTGGCTCTGAAAGGAACTCGATTATTGAGCACCTTGTGGTTTAAGCCGCCTTTTTTAT 
** **** ***** ****** * ********* * ******** ** ** *** * * 

BCVspike CTGATTTTATTAATGGTATTTTTGCTAAGGTCAAAAATACCAAGGTTATTAAAAATGGTG 
HCVspike CTGATTTTATTAATGGTATTTTTGCTAAGGTCAAAAATACCAAGGTTATTAAAAAGGGTG 
CRCVspike CTGATTTTATTGATGGTGTTTTTGCTAAGGTAAAAAATACCAAGGTTATTAAAGATGGTG 
HEVspike CACCTTTTAATGATGGTATTTTTGCCAAGGTTAAAAACAGCAGATTTTCTAAACATGGTG 
* ***** * ***** ******* ***** ***** * ** ** **** * **** 
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BCVspike T AAT G T ATAG T GAG T T T C C T GC T ATAAC T AT AGGT AG T AC T T T T G T AAAT AC AT C C TATA 

HCVspike T AAT G TAT AG T GAG TTTCCTGC T AT AAC T AT AGG T AG T AC T T T TG T AAAT AC AT CC TATA 

CRCVspike TAG T G TAT AG T GAG TTTCCTGC TAT AAC TAT AG G TAG T AC T T T T G T AAAT AC AT C CT AT A 

HEVspike T TAT T TAT AG T GAG TTTCCTGCTAT T AC TAT AG G TAG T AC T T T T G T AAAT AC T T C C TATA 

* * * * * * * * * * * * * * * * * * * * * * ************************** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



G T GT G G TAG T AC AAC C AC AT AC T ACC AAT T TAG AT AAT AAAT T AC AAGG T C T C T TAG AG A 
GT G T G G TAG T AC AAC C AC AT AC T ACC AAT T T G G AT AAT AAATT AC AAGG T C T C T TAG AG A 
GT G T G G TAG T AC AAC C AC AT AC T AC T AAT T TAG AT AAT AAAT T AC AAG GT C T C T T AGAG A 
GCATAGTAGTAAAGCCTCATACCTCATTTATTAATGGTAATTTACAAGGTTTTTTGCAAA 

* * ****** * ** ***** * k k ** kkk * * * * * * * * * * ** k * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



TCTCTGTTTGCCAGTATACTATGTGCGAGTACCCACATACGATTTGTCATCCTAATTTGG 
TCTCTGTTTGCCAGTATACTATGTGCGAGTACCCACATACGATTTGTCATCCTAATCTGG 
TCTCTGTTTGCCAGTATACTATGTGCGATTACCCACATACGATGTGTCATCCTAATCTGG 
TTTCTGTTTGTCAATATACTATGTGTGAATACCCACAGACTATTTGTCATCCTAATTTGG 
* ******** ** *********** kk ******** k-k ** ************ *** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



GTAATCGGCGCATAGAACTATGGCATTGGGATACAGGTGTTGTTTCCTGTTTATATAAGC 
GTAATCGACGCGTAGAACTATGGCATTGGGATACAGGTGTTGTTTCCTGTTTATATAAGC 
GTAATAAACGCATAGAACTATGGCATTGGGATACAGGTGTTGTTCCCTGTTTATATAAGC 
G T AAT CAAC G C AT AGAAT TAT G G CAT CAT G AC AC AG AT GT TGTTTCTTGTT T AT ACAG G C 



*** ***** k k * * * k * * 



k-k **** k-k k k * * * k ******** k k-k 



BCVspike GTAATTTCACATATGATGTGAATGCTGATTATTTGTATTTCCATTTTTATCAAGAAGGTG 
HCVspike G T AAT T T CAC AT AT GAT G T GAAT GC T GAT T AC T T G TAT T TCC AT T T T TAT C AAG AAGG T G 

CRCVspike GT AAT T T CAC AT AT GAT G T G AAT GC T GAT TAT T T G TAT TCC CAT T T T TAT C AAG AAGG T G 

HEVspike G T AAT T T CAC AT AT GAT G T GAAT G C T GAT TAT T TAT AT T T T CAC T T T T A T C AG G AAG G T G 

******************************* ** **** ** ******** ******* 



BCVspike GTACTTTTTATGCATATTTTACAGACACTGGTGTTGTTACTAAGTTTCTGTTTAATGTTT 

HCVspike GTACTTTTTATGCATATTTTACAGACACTGGTGTTGTTACTAAGTTTCTGTTTAATGTTT 

CRCVspike GTACTTTTTATGCATATTTTACAGACACTGGTGTTGTTACTAAGTTTCTGTTTCATGTTT 

HEVspike GCACTTTTTATGCATACTTTACAGATACTGGTTTTGTGACCAAGTTTCTGTTTAAGTTGT 
* ************** ******** ****** **** ** ************ * * * 
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BCVspike ATTTAGGCACGGTGCTTTCACATTATTATGTCATGCCTTTGACTTGTAATAGTGCTATGA 
HCVspike ATTTAGGCACGGTGCTTTCACATTATTATGTCCTGCCTTTGACTTGTAATAGTGCTATGA 
CRCVspike ATTTAGGCACGGTGCTTTCACATTATTATGTCATGCCCTTGACTTGTAATAGTGCTATGA 
HEVspike ATTTAGGCACTGTGCTGTCACACTATTATGTTATGCCATTGACTTGTGATAGCGCTTTAT 

*^*** ***** ******** **** ********* **** *** * 

BCVspike CTTTAGAATATTGGGTTACACCTCTCACTTCTAAACAATATTTACTCGCTTTCAATCAAG 
HCVspike C T T T AG AAT AT T G G G T T AC AC C T C T C AC T T C T AAAC AAT AT T T AC TAG C T T T C AAT C AAG 

CRCVspike CTTTAGAATACTGGGTTACACCTCTCACTTTTAAACAATATTTACTCGCTTTCAATCAAG 
HEVspike CTTTAGAATATTGGGTTACACCTCTCACTACTAGACAATTTCTTCTAGCCTTTGACCAGG 
********** ****************** ** ***** * * ** ** * * 

BCVspike ATGGTGTTATTTTTAATGCTGTTGATTGTAAGAGTGATTTTATGAGTGAGATTAAGTGTA 

HCVspike ATGGTGTTATTTTTAATGCTGTTGATTGTAAGAGTGATTTTATGAGTGAGATTAAGTGTA 

CRCVspike ATGGTGTTATTTTTAATGCTGTTGATTGTAAGAGTGATTTTATGAGTGAGATTAAGTGTA 

HEVspike ATGGTGTTTTATACCATGCTGTTGATTGTGCTAGTGATTTTATGAGTGAGATTATGTGTA 

* * ************** ***************** ***** ***** 

BCVspike AAACACTATCTATAGCACCATCTACTGGTGTTTATGAATTAAACGGTTACACTGTTCAGC 
HCVspike AAAC AC TAT C T AT AGC AC CAT C T AC T GGTGT T TAT G AAT T AAACGG T T AC AC T GT T C AG C 

CRCVspike AAAC AC TAT C TAT AG C ACC AT C T AC T GG T G T T TAT G AAT T AAAC G G T T AC AC T G T T C AG C 

HEVspike AAAC T T C T T C AAT T AC AC C AC C T AC T GG T G T T TAT G AAC T AAAC G G T T AC AC AG T T C AAC 

**** ** ** ***** ***************** ************* ***** * 

BCVspike C AAT T G C AG AT G T T T AC C G AC G TAT AC C T AAT C T T C C C GAT T GT AA T AT AGAG G C T T G G C 

HCVspike CAATTGCAGATGTTTACCGACGTATACCTAATCTTCCCGATTGTAATATAGAGGCTTGGC 
CRCVspike CAATTGCAGATGTTTACCGACGTATACCTAATCTTCCCGATTGTAATATAGAGGCTTGGC 
HEVspike CTGTTGCCACTGTGTATCGTAGAATACCTGACTTACCCAATTGCGATATCGAAGCTTGGC 
* **** *** ** ** * ****** * * *** **** **** ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



TTAATGATAAGTCTGTGCCCTCTCCATTAAATTGGGAACGTAAGACCTTTTCAAATTGTA 
TTAATGATAAGTCGGTGCCCTCTCCATTAAATTGGGAACGTAAGACCTTTTCAAATTGTA 
TTAATGATAAGTCGGTGCCTTCTCCATTAAATTGGGAACGTAAGACCTTTTCAAATTGTA 
TTAATTCTAAGACCGTTTCTTCGCCTCTTAATTGGGAACGTAAAATTTTTTCTAATTGTA 
***** **** * ** * ** ** * ************** * ***** ******* 
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BCVspike ATTTTAATATGAGCAGCCTGATGTCTTTTATTCAGGCAGACTCATTTACTTGTAATAATA 
HCVspike AT T T T AAT AT GAG C AG C C T GAT GTCTTTTATT C AG G C AG AC T CAT T TAG T T G T AAT AAT A 

CRCVspike ATTTTAATATGAGCAGCCTGATGTCTTTTATCCAGGCTGACTCGTTTACTTGTAATAATA 
HEVspike ATTTTAACATGGGCAGGCTGATGTCTTTTATTCAGGCTGACTCTTTTGGTTGTAACAATA 
******* *** **** ************** ***** ***** *** ****** **** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



TTGATGCAGCTAAGATATATGGTATGTGTTTTTCCAGCATAACTATAGATAAGTTTGCTA 
TTGATGCTGCTAAGATATATGGTATGTGTTTTTCCAGCATAACTATAGATAAGTTTGCTA 
TTGATGCTGCTAAGATATACGGTATGTGTTTTTTCAGCATAACTATAGATAAGTTTGCTA 
TTGATGCTTCTCGCTTATATGGTATGTGTTTTGGTAGCATTACTATTGACAAGTTTGCTA 



******* 



**** ************ 



***** ** *** ** ********** 



BCVspike TACCCAATGGTAGGAAGGTTGACCTACAATTGGGCAATTTGGGCTATTTGCAGTCTTTTA 
HCVspike TACCCAATGGTAGGAAGGTTGACCTACAATTGGGCAATTTGGGCTATTTGCAGTCTTTTA 
CRCVspike TACCCAATGGTAGGAAGGTTGACCTACAAATGGGCAATTTGGGCTATTTGCAGTCTTTTA 
HEVspike TACCCAATAGTAGAAAGGTTGATCTGCAAGTGGGTAAATCTGGTTATTTACAATCTTTTA 
******** **** ******** ** *** **** ** * ** ***** ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



AC TAT AG AAT T GAT AC T AC T G C T AC AAG T T G T C AG T T G TAT TAT AAT T T AC C T G C T GC T A 
AC TAT AG AAT T G AT AC T AC T GC T ACAAG T T G T C AG T T G TAT TAT AAT T T ACC T GC T G C T A 
AC TAT AG AAT T GAT AC T AC T G C T AC AAG T T G T C AG T T G TAT TAT AAT T T AC C T GC TAG T A 
AT TAT AAG AT T G AC AC T G C T G T T AGC AG T T G T C AA C T C TAT T AT AG T T T GC C T GC AG C AA 
* **** ***** *** *** ** ******** * ******* *** ***** * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



ATGTTTCTGTTAGCAGGTTTAATCCTTCTACTTGGAATAGGAGATTTGGTTTTACAGAAC 
ATGTTTCTGTTAGCAGGTTTAATCCTTCTACTTGGAATAGGAGATTTGGTTTTACAGAAC 
ATGTTTCTATTAGCAGGTTTAATCCTTCTATTTGGAATAGGAGATTTGGTTTTACAG7\AC 
ACGTATCTGTCACTCATTATAATCCTTCATCTTGGAACAGAAGGTATGGGTTTAT T 



* * *** * * 



* ********* 



****** ** ** * *** **** 



BCVspike AATCTGTTTTTAAGCCTCAACCTGTAGGTGTTTTTACTGATCATGATGTTGTTTATGCAC 
HCVspike AATCTGTTTTTAAGCCTCAACCTGTAGGTGTTTTTACTCATCATGATGTTGTTTATGCAC 
CRCVspike AATCTGTTTTTAAGCCTCAACCTGTAGGTGTTTTTACTGATCATGATGTTGTTTATGCAC 

HEVspike AATCAGAGTTTTGGTTCCAG AGGC-CTT CATGATGCTGTATATTCAC 

**** * *** * ** *** ** ******* *** *** *** 



WO 2004/011651 



PCT/GB2003/002832 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 
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AACATTGTTTTAAAGCTCCCACAAATTTCTGTCCGTGTAAATTGGATGGGTCTTTGTGTG 

AACATTGTTTTAAAGCTCCCACAAATTTCTGTCCGTGTAAATTGGATGGGTCTTTGTGTG 

AACATTGTTTTAAAGCTCCCACAAATTTCTGTCCGTGTAAATTGAATGGGTCTTTGTGTG 

AGC AAT GT T T T AAT AC ACCT AAT ACAT AT TGTCCTTG T A GAACAAGXC — AATGCA 

* ******** * ** * * * ***** **** * * *** ** 

TAGGTAGTGGTTCTGGTATAGATGCTGGTTATAAAAATAGTGGTATAGGCACTTGTCCTG 

TAGGTAATGGTCCTGGTATAGATGCTGGTTATAAAAATAGTGGTATAGGCACTTGTCCTG 

TAGGTAGTGGTTTTGGTATAGATGCTGGTTATAAAAATAGTGGTATAGGCACTTGTCCTG 

TAGGTGGTG CTGGCACAGGAACTTGTCCTGTAGGCACCACTGTGCGCAAGTGTTTTG 

***** ** ****** ** ** * * * * * *** *** ** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CAGGTACTAATTATTTAACTTGTCATAATGCTGCCCAATGTAATTGTTTGTGCACTCCAG 
CAGGTACTAATTATTTAACTTGCCATAATGCTGCCCAATGTGATTGTTTGTGCACTCCCG 
CAGGTACTAATTATTTAACTTGTTATAATGCTAACCAATGTGATTGTTTGTGCACTCCAG 
CTG C-AGTTAC — A AACGCTACTAAGTGTACTTGCTGGTGTCAACCAG 



* * * * *** * 



** *** * *** *** * *** ** * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



AC C C CAT T AC AT C T AAA T C T AC AG G GC C T T AT AAG T G C C C C C AAAC T AAAT AT T TAG T T G 

AC C C CAT TAG AT C T AAAT C T AC AG G G CC T T ACAAG T GC CC CCAAAC T AAAT AC T TAG T T G 

ACCCTATTTTATCTAAATCTACAGGGCCTTATAAGTGCCCCCAAACTAAATACTTAGTTG 

ATCCTTCCACATATAAAGGTGTAAATGCCTGGACTTGTCCGCAATCTAAAGTTTCTATAC 
* ** ** **** * * * * * ** ** *** ***** * * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



GCATAGGTGAGCACTGTTCGGGTCTTGCTATTAAAAGTGATTATTGTGGAGGTAATCCTT 
GCATAGGTGAGCACTGTTCGGGTCTTGCTATTAAAAGTGATTATTGTGGAGGTAATCCTT 
GCATAGGTGAGCACTGTTCTGGTCTTGCTATTAAAAGTGATTATTGTGGAGGCAATCCTT 
AACCAGGTCAGCATTGCCCTGGCTTGGGTCTTGTGGAGGATGATTGCTCTGGTAATCCTT 



**** **** ** * ** * * * ** 



*** **** ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



GTACTTGCCAACCACAAGCATTTTTGGGTTGGTCTGTTGATTCTTGTTTACAAGGGGATA 

GTACTTGCCAACCACAAGCATTTTTGGGTTGGTCTGTTGACTCTTGTTTACAAGGGGATA 

GTACTTGCCAACCAAAAGCATTTTTGGGTTGGTCTGTGGACTCTTGTTTACAAGGGGATA 

GCACTTGTAAACCACAGGCTTTCATAGGCTGGAGTTCAGAA?\CTTGTTTGCAA?VATGGTA 
* ***** ***** * ** ** * ** *** * ** ******* *** * ** 
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BCVspike 
HCVspike 
CRCVspike 
HEVspike 



GGTGTAATATCTTTGCTAATTTTATTTTGCATGATGTTAATAGTGGTACTACTTGTTCTA 
GGTGTAATATTTTTGCTAATTTTATTTTGCATGATGTTAATAGTGGTACTACTTGTTCTA 
GGTGTAATATTTTTGCTAATTTTATTTTGCATGGTGTTAATAGTGGTACTACTTGTTCTA 
GGTGTAATATTTTTGCTAATTTTATTTTGAATGATGTTAATAGCGGTACTACCTGTTCTA 



********** 



****************** *** ********* ******** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CTGATTTACAAAAATCAAACACAGACATAATTCTTGGTGTTTGTGTTAATTATGATCTTT 
CTGATTTACAAAAATCAAACACAGACATAATTCTTGGTGTTTGTGTTAATTATGATCTTT 
CTGATTTACAAAAATCAAACACAGACATAATTCTTGGTGTTTGTGTTAATTATGATCTTT 
C T GAT T T AC AAC AG G G T AAT AC T AAT AT TAG T AC T GAT GTTTGTGT T AAT TAT G AC C TAT 



********** 



* * ** ** * ** * * ** ****************** ** * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



ATGGTATTACAGGCCAAGGTATTTTTGTTGAGGTTAATGCGACTTATTATAATAGTTGGC 
ATGGTATTACAGGCCAAGGTATTTTTGTTGAGGTTAATGCGCCTTATTATAATAGTTGGC 
ATGGTATTACAGGCCAAGGTATTTTTGTTGAGGTTAATGCGACTTATTATAATAGTTGGC 

AT G G CAT T AC AG G C C AG GG CAT AC T T AT AG AAG T T AAT G C C AC G T ATT AT AAT AGT T G G C 
**** *********** ** ** ** * ** ******** * **************** 

AG AAC CTTTTATATGATTC T AAT G G T AAT C T C TAT G G T T T T AGAG ACT AC T T AAC AAAC A 
AG AAC C T T T TAT AT GAT T C T AAT G G T AAT C T C T AT G G T T T TAG AG AC T AC T T AAC AAAC A 
AG AAC C T T T TAT AT GAT T C T AAT GGT AAT C T C TAT G G T T T TAG G G ACT ACT T AAC AAAC A 
AGAATCTTCTTTATGATTCTAGTGGT7\ATCTCTATGGCTTTAGAGATTATTTATCAAATA 
**** *** * ********** *************** ***** ** ** *** **** * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



GAACTTTTATGATTCGTAGTTGCTATAGCGGTCGTGTTTCAGCGGCCTTTCATGCTAATT 
GAACTTTTATGATTCGTAGTTGCTATAGCGGTCGTGTTTCAGCGGCCTTTCATGCTAACT 
GAACTTTTATGATTCGTAGTTGCTATAGCGGTCGTGTTTCAGCGGGCTTTCACTCTAACT 
GAACCTTTCTTATTCGTAGCTGCTATAGTGGAAGAGTTTCAGCAGTCTTTCATGCTAACT 



**** *** * ** 



****** ******** ** * ******** * ****** **** * 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CTTCCGAACCAGCATTGCTATTTCGGAATATTAAATGCAATTACGTTTTTAATAATACTC 
CTTCCGAACCAGCATTGCTATTTCGGAATATTAAATGCAGTTACGTTTTTAATT^ATACTC 
C T T C C GAAC C AG CAT T G C TAT T T CG G AAT AT T AAAT GC AAT T AC G T T T T T AAT AAT AC T C 
CTTCTGAACCAGCTTTGATGTTTCGTAATCTTAAATGCAGCCACGTTTTTAATTATACCA 



**** ****** 



** *** * ***** *** ********* *********** **** 
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TTTCACGACAGCTGCAACCTATTAACTATTTTGATAGTTATCTTGGTTGTGTTGTCAATG 
TTTCACGACAGCTGCAACCTATTAACTATTTTGATAGTTATCTTGGTTGTGTTGTCAATG 
TTTCACGACAGCTGCAACCTATTAACTATTTTGATAGTTATCTTGGTTGTGTTGTCAATG 
TTTTAAGACAAATACAGCTTGTTAATTATTTTGATAGTTACCTTGGTTGTGTTGTTAATG 
*** * **** * ** * * **** ************** ************** **** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CTGATAATAGTACTTCTAGTGCTGTTCAAACATGTGATCTCACAGTAGGTAGTGGTTACT 
CTGATAATAGTACTTCTAGTGTTGTTCAAACATGTGATCTCACAGTAGGTAGTGGTTACT 
CTGATAATAGTACTTCTAGTTCTGTTCAAACATGTGATCTCACAGTAGGTAGTGGTTACT 
CTTATAATAATACAGCTAGTGCTGTAAGTACTTGTGATTTAACCGTTGGTAGCGGCTATT 



** ****** * 



** ***** *** ** ****** * ** ** ***** ** ** * 



BCVspike GTGTGGATTACTCTACAAAAAGACGAAGTCGTAGAGCGATTACCACTGGTTATCGGTTTA 
HCVspike GTGTGGATTACTCTACAAAAAGACGAAGTCGTAGAGCGATTACCACTGGTTATCGGTTTA 
CRCVspike G G G G G G A T T AC T C T AC AC AAAG AC G AAG T C G T AG AAC GAT T AC C AC TGGTTATCGGTT T A 

HEVspike G T G T T GAT TAT GT T ACAGC AC T T AGAT C AC GT AGAT C T T T T AC T AC AG GT TAT C G CT T T A 

* * ***** **** * ** ****** * **** ** ******** **** 

BCVspike CTAATTTTGAGCCATTTACTGTTAATTCAGTAAATGATAGTTTAGAACCTGTAGGTGGTT 
HCVspike CTAATTTTGAGCCATTTACTGTTAATTCAGTAAATGATAGTTTAGAACCTGTAGGTGGTT 
CRCVspike CTAATTTTGAGCCATTTACTGTTAATCCAGTAAATGATAGTTTACACCCTGTAGGTGGTT 
HEVspike CTAATTTTGAACCATTTGCCGCTAATTTGGTAAATGATAGTATAGAACCTGTTGGTGGTT 
********** ****** * * **** ************ ** * ***** ******* 

BCVspike TGTATGAAATTCAAATACCTTCAGAGTTTACTATAGGTAATATGGAGGAGTTTATTCAAA 
HCVspike TGTATGAAATTCAAATACCTTCAGAGTTTACTATAGGTAATATGGAGGAGTTTATTCAAA 
CRCVspike T G TAT G AAAT T C AAAT ACC T T C AG AG T T T AC T AT AGG T AAT AT GGAGGAGT T TAT T C AAA 

HEVspike TGTATGAAATACAGATACCTTCAGAGTTTACCATTGGTAATTTAGAAGAATTCATTCAAA 
********** ***************** ** ****** * ** ** ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



TAAGCTCTCCTAAAGTTACTATTGATTGTTCTGCTTTTGTCTGTGGTGATTATGCAGCAT 
CAAGCTCTCCTAAAGTTACTATTGATTGTTCTGCTTTTGTCTGTGGTGATTATGCAGCAT 
CAAGATCTCCTAAAGTTACTATTGATTGTCCTGTTTTTGTCTGTGGTGATTATGCAGCAT 
CGAGTTCCCCTAAGGTTACTATAGATTGTGCTACATTTGTTTGTGGTGACTATGCTGCAT 
** ** ***** ******** ****** ** ***** ******** ***** **** 
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BCVspike G T AAAT C AC AG T T G G T T GAAT AT GG T AG T T T C T G T G AC AAT AT T AAT G C TAT AC T C AC AG 

HCVspike G T AAAT C ACAG T T G G T T GAAT AT G G T AGC T T C T GT G AC AAT AT T AAT G C TAT AC T C ACAG 

CRCVspike G T AAAT CAC AG T T G G T T GAAT AT GG TAG T T T T T G T G AC AAT AT T AAT G C TAT AC T C AC AG 

HEVspike G T AG AC AAC AG T TAG C T GAG TAT G G TAG T T T T T G T G AG AAC AT T AAT G C TAT AC T CAT AG 

* ****** * *** ******** ** ***** ** **************** ** 

BCVspike AAG T AAA T G AA C T AC T T G AC AC T AC AC AG T T G C AAG TAG C T AAT AG T T T AAT GAAT G G T G 

HCVspike AAGTAAATGAACTACTTGACACTACACAGTTGCAAGTAGCTAATAGTTTAATGAATGGTG 
CRCVspike AAGTAAATGAACTACTTGACACTACACAGTTGCAAGTAGCTAATAGTTTAATGAATGGTG 
HEVspike AAG T AAAT GAAC T AC T T G AC AC T AC AC AG T T G C AAG TAG C T AAT AG T T T AAT GAAT G GAG 

********************************************************** * 

BCVspike TCACTCTTAGCACTAAGCTTAAAGATGGCGTTAATTTCAATGTAGACGACATCAATTTTT 
HCVspike T CAC T C T TAG CAC T AAG C T T AAAG AT GG C G T T AAT T T C AAT GT AGAC G ACAT C AAT T T T T 

CRCVspike T CAC T C T TAG CAC T AAG C T T AAAGAT G GC T T T AAT T T C AAT G T AGAT G AC AT C AAT T T T T 

HEVspike T CAC C C T T AG T AC T AAGAT T AAG G AT GG G AT T AAT T T C AAT G T T G AC GAT AT C AA C T T C T 

**** ***** ****** **** ***** ************* ** ** ***** ** * 

BCVspike C C C C T G TAT TAGG T T G T T TAG GAAG CG AT T G T AAT AAAGT T T C C AGT AG AT C T G C TAT AG 

HCVspike CCCCTGTATTAGGTTGTTTAGGAAGCGCTTGTAATAAAGTTTCCAGCAGATCTGCTATAG 
CRCVspike CCCCTGTATTAGGTTGTTTAGGAAGCGAATGTAATAAAGTTTCCAGTAGATCTGCTATAG 
HEVspike CCTCTGTATTAGGTTGTTTAGGAAGCGAATGTAACAGAGCTTCCACTAGATCTGCTATAG 
** ********************* *** ***** * ** ***** ************* 

BCVspike AGGATTTACTTTTTTCTAAAGTAAAGTTATCTGATGTCGGTTTTGTTGAGGCTTATAATA 
HCVspike AGGATTTACTTTTTTCTAAAGTAAAGTTATCTGATGTCGGTTTCGTTGAGGCTTATAATA 
CRCVspike AGGATTTACTTTTTTCTAAAGTAAAGTTATCTGATGTTGGTTTTGTTGATGCTTATAATA 
HEVspike AGGATTTACTTTTTGATAAAGTAAAATTGTCTGATGTCGGTTTTGTACAGGCCTATAATA 
************** ********* ** ******** ***** ** * ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



ATTGTACTGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATAATGGTATCA 
ATTGTACTGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATAATGGTATCA 
ATTGTACTGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATAATGGTATCA 
ACTGCACTGGAGGAGCCGAAATTAGGGATCTCATTTGTGTGCAAAGTTATAATGGTATCA 
* ** ******** ************** ******************************* 
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AAGTGTTGCCTCCACTACTCTCAGAAAATCAGATCAGTGGATACACTTTGGCTGCTACCT 
AAGTGTTGCCTCCACTGCTCTCAGTAAATCAGATCAGTGGATACACTTTGGCTGCCACCT 
AAGTGTTGCCTCCACTGCTCTCAGAAAATCAGATCAGTGGATACACTTTGGCTGCCACCT 
AAGTGTTGCCTCCATTGTTATCTGAAAATCAGATTAGTGGTTACACTTCGGCAGCCACCG 
************** * * ** * ********* ***** ******* *** ** *** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CTGCTAGTCTGTTTCCTCCTTGGTCAGCAGCAGCAGGCGTACCATTTTATTTAAATGTTC 
CTGCTAGTCTGTTTCCTCCTTGGTCAGCAGCAGCAGGTGTACCATTTTATTTAAATGTTC 
TTGCTAGTCTGTTTCCTCCTTGGTCAGCAGCAGCAGGCGTACCATTTTATTTAAATGTTC 
CTGCTAGCCTATTTCCTCCCTGGACAGCTGCAGCAGGTGTACCATTTTATTTAAATGTTC 



****** ** **** 



**** *** **** ******** ********************** 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



AGTATCGTATTAATGGGATTGGTGTTACCATGGATGTTCTAAGTCAAAATCAAAAGCTTA 
AGTATCGTATTAATGGGATTGGTGTTACCATGGATGTGTTAAGTCAAAATCAAAAGCTTA 
AGTATCGTATTAATGGTATTGGTGTTACCATGGATGTGCTAACTCAAAATCAAAAGCTTA 
AGTATCGTATAAATGGGCTTGGCGTCACCATGGATGTGCTAAGCCAAAACCAAAAGCTTA 
********** ***** **** ** *********** *** ***** ********** 

TTGCTAATGCATTTAACAATGCCCTTGATGCTATTCAGGAAGGGTTTGATGCTACCAATT 
TTGCTAATGCATTTAGCAATGCTCTTGATGCTATTCAGGAAGGGTTTGATGCTACCAATT 
TTTCTAATGCATTTAACAATGCCCTTGATGCTATTCAGGAAGGGTTTGATGCTACCAATT 
TTGCTAGTGCATTTAACAACGCTCTTGATTCTATCCAGG7y\GGGTTCGACGCAACCAATT 
** *** ******** *** **" ****** **** *********** ** ** ******* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



CTGCTTTAGTTAAAATTCAAGCTGTTGTTAATGCAAATGCTGAAGCTCTTAATAACTTAT 
CTGCTTTAGTTAAAATTCAAGCTGTTGTTAATGCAAATGCTGAAGCTCTTAATAACTTAT 
CTGCTTTAGTTAAAATTCAAGCTGTTGTTAATGCAT^ATGCTGAAGCTCTTAATAACTTAT 
CTGCTTTAGTTAAAATTCAGGCTGTTGTTAATGCAAATGCTGAAGCACTTAATAACTTAT 



******* 



************ ************************** ************* 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 



TGCAACAACTCTCTAATAGATTTGGTGCTAT7\AGTTCTTCTTTACAAGAAATTCTATCTA 
TGCAACAACTCTCTAATAGATTTGGTGCTATAGGTTCTTCTTTACAAGAAATTCTATCTA 
TGCRACAACTCTCTAATAAATTTGGTGCTATAAGTGCTTCTTTACAAGT^AATTCTATCTA 
TGCAGCAACTCTCTAACAGATTTGGTGCCATAAGTGCCTCTTTACAAGAAATTTTATCCA 
*** *********** * ********* *** ** * *************** **** * 



WO 2004/011651 



BCVspike 
HCVspike 
CRCVspike 
HEVspike 

BCVspike 
HCVspike 
CRCVspike 
HEVspike 

BCVspike 
HCVspike 
CRCVspike 
HEVspike 

BCVspike 
HCVspike 
CRCVspike 
HEVspike 

BCVspike 
HCVspike 
CRCVspike 
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G AC T T GAT G C T C T T G AAG C G C AAGC T C AG AT AG AC AGAC T TAT T AAT GGGCGTCTT ACC G 
GAC T GG AT GC T C T T GAAGCGC AAG C T C AG AT AG ACAGAC T TAT T AAT GGGCGTCT T ACC G 
GACTTGATGCTCTTGAAGCGCAAGCTCAGATAGACAGACTTATCAATGGGCGTCTTACCG 
GGCTCGATGCTCTTGAAGCTAAAGCTCAGATAGACAGACTTATTAATGGGCGTCTCACCG 
* ** ************** ********************** *********** **** 

CTCTTAATGCTTATGTTTCTCAACAGCTTAGTGATTCTACACTAGTAAAATTTAGTGCAG 
CTCTTAATGCTTATGTTTCTCAACAGCTTAGTGATTCTACACTAGTAAAATTTAGTGCAG 
CTCTTAATGCTTATGTTTCTCAACAGCTTAGTGATTCTACACTAGTAAAATTTAGTGCAG 
CTCTTAATGCTTATGTTTCTCAGCAGCTTAGTGATTCTACACTAGT^AAATTTAGTGCAG 
********************** ************************************* 

C AC AAG C TAT G G AG AAGG T T AAT G AAT G T G T C AAAAG C C AAT CAT C T AGG AT AAAT T T T T 
C AC AAG CT AT GGAGAAGG T T AAT GAAT GT GTC AAAAG C C AATC AT C T AGG AT AAAT T T T T 
C AC AAG C TAT G G AG AAGGT T AAT G AAT G T GT C AAAAG C C AA T CAT C TAG GAT AAAT T T T T 
C ACAAG C T AT TG AGAAAG T T AAT GAAT G T G T T AAAAGC C AAT CAT CT AG GAT AAAT T T C T 
********** ***** ************** ************************** * 

GTGGTAATGGT7VATCATATTATATCATTAGTGCAGAATGCTCCATATGGTTTGTATTTTA 
GTGGTAATGGTAATCATATTATATCATTAGTGCAGAATGCTCCATATGGTTTGTATTTTA 
GTGGTAATGGTAATCATATTATATCATTAGTGCAGAATGCTCCATATGGTTTGTATTTTA 
G T G GT AAT G GT AAT CAT AT TAT AT CAT TAG T ACAG AAT G C T CCAT AT G G T T T G T AT T T T A 
******************************* **************************** 

TCCACTTTAGCTATGTCCCTACTAAGTATGTCACTGCGAAGGTTAGTCCCGGTCTGTGCA 

TCCACTTTAGCTATGTCCCTACTAAGTATGTCACTGCGAAGGTTAGTCCCGGTCTGTGCA 

TCCACTTTAGCTATGTCCCTACTAAGTATGTCACTGCGAAGGTTAGTCCCGGTCTGTGCA 

TCCATTTTAGCTATGTCCCCACCAAGTATGTTACAGCAAAGGTTAGTCCTGGTTTGTGCA 
**** ******** ****** ** ******** ** ** *********** *** ****** 

TTGCTGGTGATAGAGGTATAGCCCCTAAGAGTGGTTATTTTGTTAATGTAAATAACACTT 
TTGCTGGTGATAGAGGTATAGCCCCTAAGAGTGGTTATTTTGTTAATGTAAATAATACTT 
T Y G C AG GT GAT AG AG G TAT AG C T C C T AAG AG T G G T TAT T T T G T T AAT G T AAAT AAC AC T T 
TTGCTGGC GAT AT AG GAAT AT C GC C T AAG AG T GG T TAT T T TAT T AAT G T AAAT AAC T C T T 
* ** ** **** *** *** * ****************** ************* *** 
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GGATGTTCACTGGTAGTGGTTATTACTACCCTGAACCTATAACTGGAAATAATGTTGTTG 
GGATGTTCACTGGTAGTGGTTATTACTACCCTGAACCCATAACTGGAAATAATGTTGTTG 
GGATGTTCACTGGTAGTGGTTATTACTACCCTGAACCTATAACTGGAAATAATGTGGTTG 
GGATGTTCACTGGTAGTGGCTATTACTACCCTGAACCTATAACCCAAAATAATGTTGTTG 
******************* ***************** ***** ********* **** 

TTATGAGTACCTGTGCTGTTAATTACACTAAAGCACCGGATGTAATGCTGAACATTTCAA 
TTATGAGTACCTGTGCTGTTAACTATACTAAAGCGCCGGATGTAATGCTGAACATTTCAA 
TTATGAGTACCTGTGCTGTTAACTATACTAAAGCACCGGATGTAATGCTGAACATTTCAA 

T GAT GAGT AC GTGTGCTGT T AAT TAT AC T AAAG CACC G GAT C T AAT G C T GAAC AC AT C G A 
* ******** *********** ** ******** ****** ************ ** * 

CACC C AAC C T C C C T GAT T T T AAG G AAG AG T T G GAT C AAT G G T T T AAAAAC C AAAC AT C AG 
C AC C C AAC C T C CAT GAT T T T AAG G AAG AG T T G GAT C AAT GGT T T AAAAAC C AAAC AT CAG 
C AC C C AAC C T C C C T GAT T T T AAG G AAG AG T T G GAT C AAT G G T T T AAAAAC C AAAC AT T AA 
CACCCAACCTTCCTGATTTCAAGGAAGAATTGTATCAATGGTTTAAAAACCAATCTTCAT 
********** * ****** ******** *** ******************** * * * 

T G GC AC CAGAT T T G T C AC T T GAT T AT AT AAATGT T AC AT T C T T GG ACC T AC AAG AT G AAA 
T G GC AC CAG AT T T G T C AC T T GAT T AT AT AAAT G T T AC AT T C T T G G AC C T AC AAG AT G AAA 
T G G C AC CAGAT T T G T C AC T T GAT T AT AT AAAT G T T AC AT T C T T G G AC C T AC AAG AT G AAA 
TGGCACCAGATTTGTCATTTGATTATATTAATGTTACGTTCTTGGACCTACAAGATGAAA 
***************** ********** ******** ********************** 

T G AAT AG G T T AC AGG AGG C AAT AAAAG T T T T AAAT C AG AGC T AC AT C AAT C T C AAG G AC A 

T GAAT AGGT T ACAGGAGGC AATAAAAG T T T T AAAT C AGAGCT AC AT CAAT CT C AAGGAC A 

T G AAT AG G T T AC AG G AG G C AAT AAAAG T T T T AAA T CAT AG C T AC AT C AAT C T C AAG G AC A 

T GAAT AG G T T AC AAG AAG C TAT AAAAG T T C T AAAT CAT AG C T AC AT CAAT C T C AAG G AC A 
* * * * * * * * * ** ** ** ** ** * * * * * ******* ********************** 
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TTGGTACATATGAGTATTATGTAAAATGGCCTTGGTATGTATGGCTTTTAATTGGCCTTG 
TTGGTACATATGAGTATTATGT7y\AATGGCCTTGGTATGTATGGCTTTTAATTGGCTTTG 
TTGGTACATATGAATATTATGT7\AAATGGCCTTGGTATGTATGGCTTTTAATTGGCCTTG 
TTGGTACATATGAGTATTATGTGAAATGGCCTTGGTATGTATGGCTTTTAATTTGCCTTG 
************* ******** ****************************** ** *** 
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CTGGTGTAGCTATGCTTGTTTTACTATTCTTCATATGCTGTTGTACAGGATGTGGGACTA 

CTGGTGTAGCTATGCTTGTTTTACTATTCTTCATATGCTGTTGTACAGGATGTGGGACTA 

CTGGCGTAGCTATGCTTGTTTTACTATTCTTCATATGCTGTTGTACAGGATGTGGGACTA 

CTGGTGTAGTTATGCTTGTTTTACTATTCTTCATATGCTGCTGTACAGGATGTGGGACTA 
**** -k-kk-k ******************* 

GTTGTTTTAAGAAATGTGGTGGTTGTTGTGATGATTATAC 

GT T GT T T T AAGAT AT GTGGTGGTTGTTGT GAT GAT TAT AC T G G AC AC C AG G 

GTTGTTTTAAGAAATGCGGTGGTTGTTGTGATGATTATACTGGACATCAGG — 

GTTGTTTTAAGAAATGTGGCGGTTGTTTTGATGATTATACTGGACACCAGGAGTTTGTAA 

************ * * * k k ******* ************ 
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T C AAAAC T T C AC AT G ACG AT T AAT T T C G T 
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MFLILLISLPMALAVIGDLKCTTVSINDVDTGVPSVSTDTVDVTNGLGTYYVLDRV 

MFLILLISLPTAFAVIGDLKCTTVSINDIDTGAPSISTDIVDVTNGLGTYYVLDRV 

MFLILLISLPMAFAVIGDLKCTXVSINDVDTGAPSISTDVVDVTNGLGTYYVLDRV 

MFFILLITLPSVFAVIGDLKCNTSSINDVDTGVPSISSEVVDVTNGLGTFYVLDRV 

MI VLVTC I LLLCS YHTAS S T SNN DCRQVNVTQLDGNENLI RDFLFQN FKEEGT VVVGG — 

YLNTTLLLNGYYPTSGSTYRNMALKGTLLLSTLWFKPPFLSDFINGIFAKVKNTECVIKNG 
YLNTTLLLNGYYPTSGSTYRNMALKGTLLLSRLWFKPPFLSDFINGIFAKVKNTKVIKKG 
YLNTTLLLNGYYPTSGSTYRNMALKGTLLLSTLWFKPPFLSDFIDGVFAKVKNTKVIKDG 
YLNTTLLLNGYYPISGATFRNVALKGTRLLSTLWFKPPFLSPFNDGIFAKVKNSRFSKHG 
YYPTEVWYNCSRTATTTAYEYFSNIHAFYFDMEAMENSTGNARGKPLLFHVHGEPVS — V 
* * : * : :. » 

VMYSEFPAITIGSTFVNTSYSVVVQPHTTNLDNKLQGLLEISVCQYTMCEYPHTICHPNL 
VMYSEFPAITIGSTFVNTSYSVWQPHTTNLDNKLQGLLEISVCQYTMCEYPHTICHPNL 
VVYSEFPAITIGSTFVNTSYSVVVQPHTTNLDNKLQGLLEISVCQYTMCDYPHTMCHPNL 
VIYSEFPAITIGSTFVNTSYSIWKPHTSFINGNLQGFLQISVCQYTMCEYPQTICHPNL 
I I YI S YRDDVQHRPLLKHGLVCITESRNI DYN-SFTSSQWNS ICTGNDRKI PFSVI PTDN 

• * • ♦ • k *k • » * 

«. v # • • • * a • ft *•**•* • • • • • • * • * ■> * 

GNRRI ELWHWDTGVVS CL YKRN FT YDVN ADYLYFHFYQEGGTFYAYFTDTGVVT 

GNRRVELWHWDTGVVSCLYKRNFTYDVN — ADYLYFHFYQEGGTFYAYFTDTGVVT 

GNKRIELWHWDTGVVPCL YKRN FT YDVN AD YL Y S H F YQEGGT F YAY FT DT G VVT 

GNQRI ELWHHDTDVVSCLYRRNFT YDVN ADYLYFHFYQEGGTFYAYFTDTGFVT 

GTKIYGLEWNDEFVTAYISGRSYNWNINNNWFNNVTLLYSRSSTATWQHSAAYVYQGVSN 

k • k k k • k k » k k 

KFLFNVYLGTVLSHYYVMP LTCNSAMTLEYWVTPLTSKQYLLAFNQDGVIF 

KFLFNVYLGTVLSHYYVLP LTCNSAMTLEYWVTPLTSKQYLLAFNQDGVI F 

KFLFHVYLGTVLSHYYVMP LTCNSAMTLEYWVT PLTFKQYLLAFNQDGVI F 

KFLFKLYLGTVLSHYYVMP LTCDSALSLEYWVTPLTTRQFLLAFDQDGVLY 

FTYYKLNNTNGLKTYELCEDYEYCTGYATNIFAPTVGGYIPDGFSFNNWFLLTNSSTFVS 

. . . k k • » k ...••■*•» • 
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NAVDCKSDFMSEIKCKTLSIAPSTGVYELNGYTVQPIADVYRR-IPNLP.DCNIEAWLNDK 
NAVDCKSDFMSEIKCKTLSIAPSTGVYELNGYTVQPIADVYRR-IPNLPDCNIEAWLNDK 
NAVDCKSDFMSEIKCKTLSIAPSTGVYELNGYTVQPIADVYRR-IPNLPDCNIEAWLNDK 
HAVDCASDFMSEIMCKTSSITPPTGVYELNGYTVQPVATVYRR-IPDLPNCDIEAWLNSK 
GRFVTNQPLLVNCLWPVPSFGVAAQEFCFEGAQFSQCNGVFLNNTVDVIRFNLNFTADVQ 

• . • • • . * • ■ • • • . * • • • * • • 

SVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCN NIDAAKI YGMCFSSITIDK 

SVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCN NIDAAKI YGMCFSSITIDK 

SVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCN NIDAAKIYGMCFFSITIDK 

TVS S PLNWERKI FSNCNFNMGRLMS FI QADS FGCN N I DAS RL YGMC FG S I T I DK 

SGMGATVFSLNTTGGCILEISCYNDIVSESSFYSYGEIPFGVTDGPRYCYVLYNGTALKY 

■ - • • • & ~k » - » »» 

FAIPNGRKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAAN-VSVSRFNPSTWNRRFG 
FAIPNGRKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAAN-VSVSRFNPSTWNRRFG 
FAI PNGRKVDLQMGNLGYLQSFNYRI DTTATSCQLYYNLPASN-VSI SRFNPSIWNRRFG 
FAIPNSRKVDLQVGKSGYLQSFNYKIDTAVSSCQLYYSLPAAN-VSVTHYNPSSWNRRYG 
FGTLPPSVKEIAISKWGQFYINGYNFFSTFPIDCISFNLTTGDSGAFWTIAYTSYTEALV 

■> - * • *}e • 9f • * • • • tJc » ■ * »• 

FTEQSVFKPQPVGVFTDHDVVYAQHCFKAPTNFCPCKLDGSLCVGSGSGIDAGYKNSGIG 
FTEQSVFKPQPVGVFTHHDVVYAQHCFKAPTNFCPCKLDGSLCVGNGPGIDAGYKNSGIG 
FTEQSVFKPQPVGVFTDHDVVYAQHCFKAPTNFCPCKLNGSLCVGSGFGIDAGYKNSGIG 

FINQS FGSRGLHDAVYSQQCFNTPNTYCPCRT — SQCIGG AGTG 

QVENTAI KKVT YCN SHI NNI KC SQLTANLQNGFYPVAS SEVGLVNKS VVLL PS F YS HTS V 

... • • • *k • • • 

... ... ... w • 

TCPAGTNYLTCH NAAQCNCLCTPDPITSKSTGPYKCPQTKYLVGIGEHCSGLAIKS 

TCPAGTNYLTCH NAAQCDCLCTPDPITSKSTGPYKCPQTKYLVGIGEHCSGLAIKS 

TCPAGTNYLTCY NANQCDCLCTPDPILSKSTGPYKCPQTKYLVGIGEHCSGLAIKS 

TCPVGTTVRKCFAAVTNATKCTCWCQPDPSTYKGVNAWTCPQSKVSIQPGQHCPGLGLVE 
N I T I DLGMKRS G YG — QPIASTLSNITLPMQDNNTDVYCIRSNQFSVYVHSTCKSSLWDN 

• "k .... -k 
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DYCGGNPCTCQPQAFLGWSVDSCLQGDRCN — IFANFILHDVNSGTTCSTDLQKSNTDII 
DYCGGNPCTCQPQAFLGWSVDSCLQGDRCN — IFANFILHDVNSGTTCSTDLQKSNTDII 
DYCGGNPCTCQPKAFLGWSVDSCLQGDRCN — IFANFILHGVNSGTTCSTDLQKSNTDII 
DDCSGNPCTCKPQAFIGWSSETCLQNGRCN — IFANFILNDVNSGTTCSTDLQQGNTNIT 
NFNQDCTDVLYATAVIKTGTCPFSFDKLNNYLTFNKLCLSLNPTGANCKFDVAARTRTNE 
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LGVCVNYDLYGITGQGIFVEVNATYYNSWQNLLYDSNGNLYGFRDYLTNRTFMIRSCYSG 
LGVCVNYDLYGITGQGIFVEVNAPYYNSWQNLLYDSNGNLYGFRDYLTNRTFMIRSCYSG 
LGVCVNYDLYGITGQGIFVEVNATYYNSWQNLLYDSNGNLYGFRDYLTNRTFMIRSCYSG 
TDVCVNYDLYGITGQGILIEVNATYYNSWQNLLYDSSGNLYGFRDYLSNRTFLIRSCYSG 
QVVRSLYVIYEEGDNIVGVPSDNSGLHDLSVLHLDSCTDYN IYGRTGVGIIRQTNST 

RVS AAFH AN S S E PALL FRN I KCN Y VFNNTL S RQLQP I N Y FD S YLGC V VNADN S T S S AVQT 
RVSAAFHANSSEPALLFRNIKCSYVFNNTLSRQLQPINYFDSYLGCVVNADNSTSSVVQT 
RVSAGFHSNSSEPALLFRNIKCNYVFNNTLSRQLQPINYFDSYLGCVVNADNSTSSSVQT 
RVSAVFHANSSEPALMFRNLKCSHVFNYTILRQIQLVNYFDSYLGCVVNAYNNTASAVST 
ILSGLHYTSLSGDLLGFKNVSDGVVYSVTPCDVSAQAAVIDGAIVGAMTSINSELLGLTH 

: * . : : 

M / S ^ 

CDLTVGSGYCVDYSTKRRSRRAITTGYRFTNFEPFTVNS VNDS 

CDLTVGSGYCVDYSTKRRSRRAITTGYRFTNFEPFTVNS VNDS 

CDLTVGSGYWGDYSTQRRSRRTITTGYRFTNFEPFTVNP VNDS 

CDLTVGSGYCVDYVTALRSRRSFTTGYRFTNFEPFAANL VNDS 

WTTTPNFYYYSIYNTTNERTRGTAIDSNDVDCEPIITYSNIGVCKNGALVFINVTHSDGD 
* * **.*:...:**:. : • . 

in , ^wb w^w* 

LEPVGGLYEIQI PSEFTI GNMEEFIQI SS PKVT I DCSAFVCGDYAACKSQLVEYGSFCDN 
LEPVGGLYEIQIPSEFTIGNMEEFIQTSSPKVTIDCSAFVCGDYAACKSQLVEYGSFCDN 
LHPVGGLYEIQIPSEFTIGNMEEFIQTRSPKVTIDCPVFVCGDYAACKSQLVEYGSFCDN 
IEPVGGLYEIQIPSEFTIGNLEEFIQTSSPKVTIDCATFVCGDYAACRQQLAEYGSFCEN 
VQPIS-TGNVTIPTNFTISVQVEYIQVYTTPVSIDCSRYVCNGNPRCNKLLTQYVSACQT 
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€87 

INAILTEVNELLDTTQLQVANSLMNGVTLSTKLKDGVNFNVDDIN FSPVL 

INAILTEVNELLDTTQLQVANSLMNGVTLSTKLKDGVNFNVDDIN FSPVL 

INAILTEVNELLDTTQLQVANSLMNGVTLSTKLKDGFNFNVDDIN FSPVL 

INAILIEVNELLDTTQLQVANSLMNGVTLSTKIKDGINFNVDDIN FSSVL 

IEQALAMSASLENMEVDSMLFVSENALKLASVEAFNSTEHLDPIYKEWPNIGGSWLGGLK 
*:* . * : . : *.:.*:: ..::** 

GCLGSDCNKVSSRSAIEDLLFSKVKLSDVG-FVEAYNNCTGGAEIRDLICVQSYNGIKVL 
GCLGSACNKVSSRSAIEDLLFSKVKLSDVG-FVEAYNNCTGGAEIRDLICVQSYNGIKVL 
GCLGSECNKVSSRSAIEDLLFSKVKLSDVG-FVDAYNNCTGGAEIRDLICVQSYNGIKVL 
GCLGSECNRASTRSAIEDLLFDKVKLSDVG-FVQAYNNCTGGAEIRDLICVQSYNGIKVL 
DILPSHNSKRKYRSAIEDLLFDKVVTSGLGTVDEDYKRCTGGYDIADLVCAQYYNGIMVL 
* * . *********^** . : : * ** : *^* **** ** 

977 /o/g 
f 



r i 

PPLLSENQISGYTLAATSASLFPPWS-AAAGVPFYLNVQYRINGIGVTMDVLSQNQKLIA 
PPLLSVNQISGYTLAATSASLFPPWS-AAAGVPFYLNVQYRINGIGVTMDVLSQNQKLIA 
PPLLSENQISGYTLAATFASLFPPWS-AAAGVPFYLNVQYRINGIGVTMDVLTQNQKLIS 
PPLLSENQISGYTSAATAASLFPPWT-AAAGVPFYLNVQYRINGLGVTMDVLSQNQKLIA 
PGVANDDKMTMYTASLAGGIALGALGGGAVAIPFAVAVQARLNYVALQTDVLNKNQQILA 

* ..*.**.. . * 

»..,». ... • .... . . ... • » .... 

/06S 
\ 

SALVKI QAVVNANAEALNNLLQQLS NRF 

SALVKI QAVVNANAEALNNLLQQLSNRF 

SALVKI QAVVNANAEALNNLLQQLSNRF 

SALVKI QAVVNANAEALNNLLQQLSNRF 



NAFNNALDAIQEGFDATN 

NAFSNALDAIQEGFDATN 

NAFNNALDAIQEGFDATN — — 

SAFNNALDSIQEGFDATN 

NAFNQAIGNITQAFGKVNDAIHQTSQGLATVAKALAKVQDVVNTQGQALSHLTVQLQNSF 

* * . * . * * * * ^ * * ^ * • * * * * . • ^ * * * ^ • * * * ^ * * 
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GAISSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 
GAIGSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 
GAISASLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 
GAISASLQEILSRLDALEAKAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAIEKVN 
QAISSSISDIYNRLDELSADAQVDRLITGRLTALNAFVSQTLTRQAEVRASRQLAKDKVN 
■k* . -v . . -k *** * # * ^ ** - * *** m ******* * : -k-k-k * - mZ * : * * : *** 
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ECVKSQSSRINFCGNGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAGDRGIA 

ECVKSQSSRINFCGNGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAGDRGIA 

ECVKSQSSRINFCGNGNHIISLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCIAGDRGIA 

ECVKSQS SRINFCGNGNHI I SLVQNAPYGLYFIHFS YVPTKYVTAKVS PGLCI AGDI GI S 

ECVRSQSQRFGFCGNGTHLFSLANAAPNGMVFFHTVLLPTAYETVTAWSGICASDGDRTF 
^c** : *** ** ***** . ** * . * . * : ** * * t t . * : * :. 

PKSGYFVNVNNTWMFTGSGYYYPEPITGNNVVVMSTCAVNYTKAPDVMLNISTP 

PKSGYFVNVNNTWMFTGSGYYYPEPITGNNVVVMSTCAVNYTKAPDVMLNISTP 

PKSGYFVNVNNTWMFTGSGYYYPEPITGNNVVVMSTCAVNYTKAPDVMLNISTP 

PKSGYFINVNNSWMFTGSGYYYPEPITQNNVVVMSTCAVNYTKAPDLMLNTSTP 

GLVVKDVQLTLFRNLDDKFYLTPRTMYQPRAATSSDFVQIEGCDVLFVNATVIDLPSIIP 



NLPDFKEELDQWFKNQTS — VAPDLSLDYINVTFLDLQDEMN- 
NLHDFKEELDQWFKNQTS — VAPDLSLDYINVTFLDLQDEMN- 
NLPDFKEELDQWFKNQTL — MAPDLSLDYINVT FLDLQDEMN- 
NLPDFKEELYQWFKNQSS — LAPDLSFDYINVTFLDLQDEMN- 



— RLQE 

RLQE 

RLQE 

RLQE 

DYIDINQTVQDILENYRPNWTVPELTIDIFNATYLNLTGEIDDLEFRSEKLHNTTVELAI 
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AIKVLNQSYINLKDIGTYEYYVKWPWYVWLLIGLAGVAMLVLLFFICCCTGCGTSCFKKC 
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AIKVLNHSYINLKDIGTYEYYVKWPWYVWLLIGLAGVAMLVLLFFICCCTGCGTSCFKKC 
AIKVLNHSYINLKDIGTYEYYVKWPWYVWLLICLAGVVMLVLLFFICCCTGCGTSCFKKC 
LIDNINNTLVNLEWLNRIETYVKWPWYVWLLIGLVVVFCIPLLLFCCCSTGCCG-CIGCL 
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GGCCDDYTGHQELVIKTSHDD 

GGCCDDYTGHQELVIKTSHDD 

GGCCDDYTGHQELVIKTSHDD 

GGCFDDYTGHQEFVIKTSHDD 

GSCCHSICSRRQFENYEPIEKVHVH- 
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FIGURE 11 
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FIGURE 12 
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TATCGCAGCC TTACTTTTGT 
GCTCTTTGTA AATCTGGTAG 
AATTTTGGGG AT TAT TAT T A 
TATATCGTAC CACTTTGTAT 
GATAGTCAAT AT TAT T T T AA 
ACCATTAACA CTGGTTTTGA 
TTAGCCATTT C AAAT GAG C T 
CGTAAGGATT TTACGCCTGT 
GAT AAC AT G A CGGCGG 



FIGURE 13 

TAATGTACCA TAT GTT TATA 
TTTAGTTCTT AATAACCCTG 
TAAGGTTGAA GCTGATTTCT 
TTTTAACGGC AAGTTTTTGT 
TAAAGACACT GGTGTTATTT 
TTTTAATTGT CAT TAT TT AC 
ATTGTTAACT GTTCCTACGA 
ACAGGTTGTT GACTCGCGGT 



ATGGCTCTGC ACAATCTACA 60 

C AT AT AT AG C TCGTGAAGCT 120 

ATTTGTCAGG TTGTGACGAG 180 

CGAATACAAA GTATTATGAT 2 40 

ATGGTTTCAA TTCTACTGAA 300 

TTTTACCCTC TGGTAATTAT 3 60 

AAGCAATCTG TCTTAATAAG 420 

GGAACAAT GC CAGGCAGTCT 4 80 

497 



FIGURE 14 

YRSLTFVNVP YVYNGSAQST ALCKSGSLVL NN PAY I AREA NFGDYYYKVE ADFYLSGCDE 60 

YIVPLCIFNG KFLSNTKYYD DSQYYFNKDT GVIYGFNSTE TINTGFDFNC HYLL.L.PSGNY 120 

LAI SNELLLT VPTKAICLNK RKDFTPVQVV DSRWNNARQS DNMTA 165 
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FIGURE 15 (Page 1 of 2) 

CRCV TATCGCAGCCTTACTTTTGTTAATGTACCATATGTTTATAATGGCTCTGCACAATCTACA 
BCV TATCGCAGCCTTACTTTTGTTAATGTACCATATGTTTATAATGGCTCTGCACAATCTACA 
OC43 TATCGCAGCCTTACTTTTGTTAATGTACCATATGTTTATAATGGCTCTGCACAATCTACA 
HECV TATCGCAGCCTTACTTTTGTTAATGTACCATATGTTTACAATGGCTCTGCACAATCTACA 
HEV TATCGCAGTCTTACTTTAGTTAATGTGCCATACGTTTACAATGGGTCAGCTCAACCCACC 
******** ******** ******** ***** ***** ***** ** ** *** * ** 

CRCV GCTCTTTGTAAATCTGGTAGTTTAGTTCTTAATAACCCTGCATATATAGCTCGTGAAGCT 
BCV QCTCTTTGTAAATCTGGTAGTTTAGTTCTTAATAACCCTGCATATATAGCTCGTGAAGCT 
OC43 GCTCTTTGTAAATCTGGTAGTTTAGTCCTTAATAACCCTGCATATATAGCTCCTCAAGCT 
HECV GCTCTTTGTAAATCTGGTAGTTTAGTTCTTAATAACCCTGCATATATAGCTCGTGAAGCT 
HEV GCACTTTGTAAGTCTGGCAGTTTAATTCTTAACAATCCTGCATATATAGCCCGTGAGGCT 
** ******** ***** ****** * ***** ** ************** * * * *** 

CRCV AATTTTGGGGATTATTATTATAAGGTTGAAGCTGATTTCTATTTGTCAGGTTGTGACGAG 
BCV AATTTTGGGGATTATTATTATAAGGTTGAAGCTGATTTTTATTTGTCAGGTTGTGACGAG 
OC43 AACTCTGGGGATTATTATTATAAGGTTGAAGCTGATTTTTATTTGTCAGGTTGTGACGAG 
HECV AATTTTGGGGATTATTATTATAAGGTTGAAGCTGATTTTTATTTGTCAGGTTGTGACGAG 
HEV AATGTGGGTGATTATTATTATAAGTCTGAAGCAGATTTTTCTCTCTCAGGTTGTGACGAG 
** ** *************** ****** ***** * * * *************** 

CRCV TATATCGTACCACTTTGTATTTTTAACGGCAAGTTTTTGTCGAATACAAAGTATTATGAT 
BCV TAT AT C GT AC CAC T T T GT AT T T T T AAC GGC AAGT T T T T GT C GAAT AC AAAGT AT TAT GAT 

OC43 TATATCGTACCACTTTGTATTTTTAACGGCAAGTTTTTGTCGAATACAAAGTATTATGAT 
HECV TAT AT C GT AC CACT T T GT AT T T T T AAC GGCAAGT T T T T GT CGAAT AC AAAG T AT TAT GAT 
HEV TAT AT C G T AC CAC TTTGTATTTT T AAT G G C AAG TTTTTGTC GAAT AC AAAGT AT TAT GAT 

************************** ********************************* 

CRCV GATAGTCAATATTATTTTAATAAAGACACTGGTGTTATTTATGGTTTCAATTCTACTGAA 

BCV GATAGTCAATATTATTTTAATAAAGACACTGGTGTTATTTATGGTCTCAATTCTACTGAA 

OC43 GAT AGT C AAT AT TAT T T T AAT AAAG AC AC T G GT GT T AT T T AT GGT C T C AAT T CT AC AG AA 

HECV GATAGTCAATATTATTTTAATAAAGACACTGGTGTTATTTATGGTCTCAATTCTACTGAA 

HEV G AT AGT CAAT AT TAT T T T AAT AAAG AC AC T GGT GT TAT T TAT GGT CT C AAT T C T AC T GAA 

******** ************************************* ********** *** 

CRCV ACCATTAACACTGGTTTTGATTTTAATTGTCATTATTTACTTTTACCCTCTGGTAATTAT 
BCV ACCATTACCACTGGTTTTGATTTTAATTGTCATTATTTAGTTTTACCCTCTGGTAATTAT 
OC43 ACCATTACCACTGGTTTTGATCTTAATTGTTATTATTTAGTTTTACCCTCTGGTAATTAT 
HECV ACCATTACCACTGGTTTTGATTTTAATTGTCATTATTTAGTTCTACCCTCTGGCAATTAT 
HEV ACCATTACCACTGGTTTTGATTTTAATTGTCATTATTTAGTTCTACCCTCTGGTAATTAT 
******* ************* ******** ******** ** ********** ****** 

CRCV TTAGCCATTTCAAATGAGCTATTGTTAACTGTTCCTACGAAAGCAATCTGTCTTZVATAAG 

BCV T T A G C CAT T T C AAAT GAG C TAT T G T T AAC T G T T C C T AC G AAAG CAAT C T G T C T T AAT AAG 

OC4 3 TTAGCCATTTCAAATGAGCTATTGTTAACTGTTCCTACGAAAGCAATCTGTCTTAATAAG 

HECV TTAGCCATTTC7\AATGAGCTATTGTTAACTGTTCCTACTAAAGCAATCTGTCTTAATAAG 

HEV CTAGCCATTTCAAATGAGCTATTGTTAACTGTTCCTACTAAAGCAATCTGTCTTAATAAG 
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CRCV CGTAAGGATTTTACGCCTGTACAGGTTGTTGACTCGCGGTGGAACAATGCCAGGCAGTCT 
BCV CGTAAGGATTTTACGCCTGTACAGGTTGTTGACTCTCGGTGGAACAATGCCAGGCAGTCT 
OC43 CGTAAGGATTTTACGCCTGTACAGGTTGTTGATTCGCGGTGGAACAATGCCAGGCAGTCT 
HECV CGTAAGGATTTTACGCCTGTACAGGTTGTTGACTCGCGGTGGAACAATGCCAGGCAGTCT 
HEV CGTAAGGTTTTTACGCCTGTACAGGTTGTTGATTCGCGGTGGAACAATGCCAGGCAATCT 
******* ************************ ** ******************** *** 

CRCV GATAACATGACGGCGGT 

BCV GATAACATGACGGCGGT 

OC 4 3 GATAACATGACGGCGGT 

HECV G AT AAC AT G AC G G C AG T 

HEV GAT AACAT GACGGCAGT 
************** * * 



FIGURE 16 

CRCV YRSLTFVNVPYVYNGSAQSTALCKSGSLVLNNPAYIAREANFGDYYYKVEADFYLSGCDE 
BCV YRSLTFVNVPYVYNGSAQSTALCKSGSLVLNNPAYIAREANFGDYYYKVEADFYLSGCDE 
OC4 3 YRSLTFVNVPYVYNGSAQSTALCKSGSLVLNNPAYIAPQANSGDYYYKVEADFYLSGCDE 
HECV YRSLTFVNVPYVYNGSAQSTALCKSGSLVLNNPAYIAREANFGDYYYKVEADFYLSGCDE 
HEV YRSLTLVNVPYVYNGSAQPTALCKSGSLILNNPAYIAREANVGDYYYKSEADFSLSGCDE 
****************** ^********* ********* : ** ****** **** ****** 

CRCV YIVPLCIFNGKFLSNTKYYDDSQYYFNKDTGVIYGFNSTETINTGFDFNCHYLLLPSGNY 

BCV YIVPLCIFNGKFLSNTKYYDDSQYYFNKDTGVIYGLNSTETITTGFDFNCHYLVLPSGNY 

OC43 YIVPLCIFNGKFLSNTKYYDDSQYYFNKDTGVI YGLNSTETITTGFDLNCYYLVLPSGNY 

HECV YIVPLCIFNGKFLSNTKYYDDSQYYFNKDTGVIYGLNSTETITTGFDFNCHYLVLPSGNY 

HEV YIVPLCIFNGKFLSNTKYYDDSQYYFNKDTGVIYGLNSTETITTGFDFNCHYLVLPSGNY 

*********************************** : ****** m **** • ** • ** : *^**^r^r 

CRCV LAI SNELLLTVPTKAI CLNKRKDFT PVQVVDS RWNNARQS DNMTA 

BCV LAI SNELLLTVPTKAI CLNKRKDFT PVQVVDSRWNNARQS DNMTA 

OC43 LAI SNELLLTVPTKAICLNKRKDFTPVQVVDSRWNNARQS DNMTA 

HECV LAI SNELLLTVPTKAI CLNKRKDFT PVQVVDSRWNNARQS DNMTA 

HEV LAI SNELLLTVPTKAI CLNKRKVFT PVQVVDS RWNNARQS DNMTA 
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FIGURE 17 
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